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(57) Abstract: In accordance with the present invention, there arc provided selection systems and methods for screening for agents 
that control splicing of intcins in their native host protein (cxtcin) or in homologous cxtcins. Specifically, there arc provided positive 
CO genetic selection systems for the screening of agents which inhibit or activate protein splicing which comprise: a host cell containing 
22 a chromosomal gene encoding either a drug-resistant form of a target enzyme or a wild-type target enzyme, and a plasmid-bome 
^i-j gene encoding either a drug -sensitive form of the target enzyme, which is dominantly cytotoxic upon interaction with the drug, or a 
dominantly cytotoxic form of the target enzyme. In these systems the plasmid-borne gene contains an intein, and the inhibition or 
activation of splicing of the dominant cytotoxic form of the target enzyme by a given reagent results in the survival or death of the host 
cell. More specifically, positive genetic selection systems which utilize the M. xenopi GyrA intein or M. tuberculosis DnaB helicase 
intein are provided. Similar reporter systems utilizing native or homologous exteins and systems utilizing controllable inteins are 
provided, as are methods of controlling in vivo expression of proteins by modulating protein splicing with inhibiting or activating 
agents, and methods of controlling the delivery of proteinaceous drugs in vivo by modulating protein splicing. 
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SCREENING AND USE OF AGENTS WHICH BLOCK 
OR ACTIVATE INTEIN SPLICING UTILIZING 
NATURAL OR HOMOLOGOUS EXTEINS 

RELATFn APPIip AT|0N ft 

This Application is a Continuation-ln-Part of U.S. Pat. No. 
5,834,247, issued November 10, 1998, the disclosure of which 
is hereby incorporated by reference herein. 

BACKfiRmiNn of THE INVENTION 

The present invention relates to the screening for and use 
of agents which either inhibit or activate protein splicing of 
inteins (IVPS). Specifically, disclosed herein is the development 
of 2 specific reporter systems for Gyrase A and DnaB inteins. 
Agents screened for in accordance with the present invention 
can be used to control protein splicing for any purpose, in vivo or 
in vitro, including antimicrobial activity of organisms containing 
inteins in essential genes. More specifically, the present 
invention relates to the use of inteins expressed in modified or 
unmodified native protein splicing precursors or homologous 
extein systems to screen for mutations that modulate splicing 
or agents that inhibit or activate splicing. The present invention 
improves on current reporter systems used to screen for 
agents that can control splicing by using a modified or 
unmodified native precursor or precursor homolog in order to 
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take advantage of the more native intein active site formed by 
natural precursors or inteins in homologous exteins, since 
agents that are derived from non-native precursors may not 
have the identical selected activity on native precursors. 

Production of mature proteins involves the flow of 
information from DNA to RNA to protein. Precise excision of 
DNA and RNA elements which interrupt that information has 
been previously described (M. Belfort, Annu. Rev. Genet. 24:363 

(1990) ; T.R. Cech, Annu. Rev. Biochem. 59:543 (1990); Hunter 
et al., Genes Dev. 3:2101 (1989)). More recently, evidence for 
the precise excision of intervening protein sequences has also 
been described for the TFPI allele from Saccharomyces 
cerevisiae (Hirata et al., J. Biol. Chem. 265:6726 (1990); Kane 
et al., Science 250:651 (1990)) and the recA gene from 
Mycobacterium tuberculosis (Davis et al., J. Bad. 173:5653 

(1991) ; Davis et al., Cell 71:1 (1992)). Each of these genes 
contains internal in-frame peptide segments which must be 
removed to produce the mature protein. Expression of Tfp1 and 
RecA each results in two peptides: one representing the 
intervening protein sequence (IVPS) and the other the ligated 
product of the external protein sequences (EPS). In 1994, the 
terms "intein" and "extern" were adopted in place of IVPS and 
EPS, respectively (Perler, et al., Nucleic Acids Res. 22:1125- 
1127 (1994)). This post-translational processing event has been 
termed "protein splicing". Similarly, the "Vent"® DNA polymerase 
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gene from the hyperthermophilic archaeon Thermococcus 
litoralis contains two in-frame IVPS (Perler, et al., PNAS 
89:5577 (1992)) and the DNA polymerase gene from the 
hyperthermophilic archaeon Pyrococcus species GB-D contains 
one intein (Xu, M., et al., Cell 75, 1371-1377 (1993))? 

Over 80 inteins have been identified in bacteria, archaea 
and eucarya (Perler, F. B., et al. Nucleic Acids Res 25, 1087-93 
(1997), Dalgaard, J. Z., et al., J Comput Biol 4, 193-214 (1997), 
Pietrokovski, S., Protein Sci. 7, 64-71 (1998) and Perler, F. B. 
Nucleic Acids Res. 27, 346-47 (1999). Four inteins have been 
found in Mycobacterium leprae (Davis, E. O., et al., EMBO J. 13, 
699-703 (1994) and Smith, D. R., and et al. Genome Res 7, 802- 
19 (1997)) and three inteins in Mycobacterium tuberculosis 
(Cole, S. T., et al. . Nature 393, 537-44 (1998)). One intein has 
been found in Candida tropicalis (Gu, et al., J. Biol. Chem., 
268(1 0):7372-7381 (1993)). 



Controllable IVPS (CIVPS) and methods for using the same 
to modify, produce and purify target proteins has been 
described (Comb et al., U.S. Patent No. 5,496,714, issued Mar. 
5, 1996; Comb et al., U.S. Patent No. 5,834,247, issued Nov. 10, 
1 998). Methods for using inteins to screen for peptides (or 
derivative, analogic or mimetic thereof) or any agent that can 
enter cells to block or activate splicing of a natural or 
experimental reporter protein have also been described (U.S. 
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Patent No. 5,834,247, supra., at Example 17). These methods 
specifically describe the screening of peptides using 
mycobacterial inteins as targets. The preparation of an in vivo 
peptide library utilizing chicken cc-spectrin is also described. 

While a general method of screening for antimicrobial 
agents using the M. tuberculosis RecA intein in a thymidylate 
synthetase (TS) reporter system has been described (Belfort, 
U.S. Patent No. 5,795,731, issued Aug. 18, 1998), this system 
suffers from several limitations. Importantly, several studies of 
protein splicing in foreign contexts (such as the Belfort system) 
indicate that intein splicing is more efficient in the native extein 
than in foreign exteins (Xu, EMBO J. 13:5517-5522 (1994), Xu, 
EMBO J. 15:5146-5153 (1996), Telenti, J. Bacterid. 179:6379- 
6382 (1997), Chong J, Biol. Chem, 273:10567-10577 (1998), 
Liu, FEBS Lett. 408:311-314 (1997), Wu, Biochim. Biophys. Acta 
1387:422-432 (1998B), Nogami Genetics, 147:73-85 (1997), 
Kawasaki J. BioL Chem., 272:15668-15674 (1997), Derbyshire, 
Proc. Natl. Acad. Sci USA, 94:11466-11471 (1997), 
Southworth, BioTechniques 27:110-121 (1999), Figure 7)). For 
example, the use of foreign exteins yields temperature- 
dependent splicing of the Psp-GBD Pol, Mxe GyrA and 
Synechocystis DnaB inteins (Xu, EMBO J. 13:5517-5522 (1994), 
Xu, EMBO J. 15:5146-5153 (1996), Telenti, J. Bacteriol. 
179:6379-6382 (1997), Chong J. Biol. Chem, 273:10567-10577 
(1998), Liu, FEBS Lett. 408:311-314 (1997), Wu, Biochim. 
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Biophys. Acta 1387:422-432 (1998B), Nogami Genetics, 
147:73-85 (1997), Kawasaki J. Biol. Chem., 272:15668-15674 
(1997) and Southworth, BioTechniques, 27:110-121 (1999), and 
Figure 7). 



While not wishing to be bound by theory, it is believed that 
such inefficient protein splicing in the foreign extein context 
occurs because the flanking extein is, in effect, the substrate 
of the intein. It is, therefore, likely that the intein may exhibit 
substrate specificity like all other enzymes. The substrate 
specificity of the intein limits acceptable extein sequences, 
hence the native extein sequence is the optimal substrate, 
whereas foreign extein sequences may not be acceptable 
substrates at all. For example, studies of the See VMA and Mxe 
GyrA inteins indicate that thiol induced N-terminal splice junction 
cleavage and splicing are, to varying extents, dependent on the 
single extein residue preceding the intein (Chong, J. Biochem. 
273:10567-10577 (1998), Southworth, BioTechniques, 27:110- 
121 (1999)). Other extein residues have also been shown to 
affect splicing of the See VMA intein (Nogami Genetics, 147:73- 
85 (1997), Kawasaki J. Biol. Chem., 272:15668-15674 (1997)). 

Additionally, exteins may affect the packing at the intein 
active site, or global folding of the intein and/or precursor, 
hence the use of a foreign extein may result in improper folding 
of the intein or precursor and inefficient or no splicing. 
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Moreover, expression of an extein gene that naturally contains 
an intein in a foreign host, for example E. coli or yeast, may not 
be efficient (Perler et al. Proc. NatL Acad. Sci. USA 89:5577- 
5581 (1992) and Hodges, et al., Nucleic Acid Res. 20:6153-6157 
(1992)), whereas expression of the homologous endogenous 
extein is likely to be more efficient. For example, the 
Mycobacterium leprae RecA intein fails to splice in E. coli, while it 
splices in M. leprae (Davis, et al., EMBO J., 13:699-703 (1994)). 
It is possible that the M. leprae RecA intein would splice in E. coli 
RecA, although that has yet to be tested. In another example, 
the Synechocystis sp. strain PCC6803 DnaB gene, containing an 
intein, was unclonable in E. coli ( Wu, et al., Proc. Natl. Acad. Sci. 
USA 95:9226-9231 (1998)). The M. leprae GyrA precursor did 
not splice efficiently in E. coli and was mostly insoluble, while the 
homologous Mxe GyrA intein spliced efficiently in E. coli GyrA. 

* i * 

Additionally, the use of homologous exteins would 
eliminate, in many instances, the need to introduce silent 
mutations in the reporter gene in order to insert the desired 
intein (see Belfort, supra., Comb, supra, Example 17). 

T 

Homologous exteins may have naturally-occuring, conserved 
restriction enzyme sites that would allow the cloning of the 
intein into the homologous extein or they may have enough 
extein similarity to allow insertion of the intein into the 
homologous extein by recombination. Such systems also 
eliminate the need for an exogenous reporter gene, since innate 
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extefn properties of the native extein may be used for selection. 
Alternatively, the native extein may be mutated, either de novo 
or based on mutations in similar extein genes, to make the 
extein into a selectable marker or reporter gene. 

Accordingly, the most desirable intein splicing systems 
would be those systems which express an intein from one 
organism in the homologous extein from the foreign host 
organism used for expression or to express the native 
precursor gene in a suitable foreign host organism. 

Such intein systems would not only be useful in the 
screening of antimicrobial agents which inhibit intein splicing 
within a reporter gene (as described in Belfort, supra, Comb, 
supra..), but as controllable targets to direct expression of an 
extein product. Agents, for example peptides, that block intein 
splicing may be used to limit the expression of an extein in such 
systems. The suppression of such expression may be highly 
useful in the drug delivery context, where, for example, one 
wishes to turn on an enzyme which is active in killing cancer 
cells, or by delivering needed activity, for example insulin. 

Similarly, such intein systems may utilize splicing- 
incompetent inteins to screen for agents with the ability to 
activate splicing. 
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SUMMARY OF THE INVENTION 

In accordance with the present invention, there is provided 
selection systems and methods for selecting or screening for 
mutations or agents that control the splicing of inteins which 
comprise use of the intein's native host protein (extein) or a 
homologous extein in any host organism (Figure 8). Specifically, 
in one preferred embodiment, there is provided a positive 
genetic selection system for the screening of agents which 
inhibit protein splicing which comprises: a host cell which 
contains (1) a copy of the extein gene (either episomal, 
chromosomal or synthetic) gene encoding a mutant or naturally 
drug-resistant form of a target enzyme (which as used herein 
includes not only enzymes, but proteins, peptides or the like), 
- and (2) a wild-type or mutant form of the extein gene (either 
episomal, chromosomal, or synthetic) encoding a drug-sensitive 
form of the target enzyme which is dominantly cytotoxic upon 
interaction with the drug, wherein the gene encoding the drug- 
sensitive form of the target enzyme contains an intein, and 
wherein the inhibition of splicing of the drug-sensitive form of 
the target enzyme by a given reagent results in the survival of 
the host cell in the presence of the drug. In one specific 
embodiment, a positive genetic selection system which utilizes 
the M. xenopi GyrA intein is provided. This system is also 
applicable to any GyrA intein inserted at the same or different 
site in the GyrA extein gene. 
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In accordance with another preferred embodiment, there 
is provided a similar positive genetic selection system for the 
screening of agents which inhibit protein splicing which 
comprisesa host cell which contains (1) a copy of the extein 
gene (either episomal, chromosomal or synthetic) encoding a wild 
type form of a target enzyme, and (2) a gene encoding a 
dominant cytotoxic form of the target enzyme (either episomal, 
chromosomal or synthetic) wherein the gene encoding the 
dominantly and cytotoxic form of the target enzyme contains an 
intein, and wherein the inhibition of splicing of the cytotoxic 
form of the target enzyme by a given reagent results in the 
survival of the host cell. In one particularly preferred 
embodiment, a positive genetic selection system which utilizes 
the M. tuberculosis DnaB helicase intein is provided. This 
positive selection system may also employ any DnaB intein 
inserted at the same or different site in the DnaB extein gene. 
Similar systems and methods of screening for agents that 
activate protein splicing are also provided, as are reporter 
systems utilizing native or homologous exteins and systems 
utilizing inteins. 

Also provided by the present invention are methods of 
controlling in vivo expression of proteins by modulating protein 
splicing with inhibiting or activating agents. Similar methods of 
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controlling the delivery of proteinaceous drugs in vivo by 
modulating protein splicing are also provided. 

As used herein, "agent" includes, but is not limited to, a 
peptide (free or displayed on a scaffold such as chicken tx- 
spectrin), a< peptide derivative, analogic or mimetic, a natural 
product or a synthetic molecule. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 is a diagram depicting a protein splicing precursor 
and products and alternative names for each element or part 
thereof. 

Figure 2 is a diagram depicting a general scheme for the 
selection of peptides which block intein splicing of a dominant 
lethal suicide gene in vivo. 

Figure 3A is a diagram depicting the irreversible blocking 
of DNA replication by E. coli GyrA interaction with a drug 
(ofloxacin). 

Figure 3B is a diagram depicting a Mxe GyrA intein-splicing 
system for the selection of agents which block intein splicing. 
The splicing of the Mxe GyrA intein out of the homologous Eco 
GyrA extein produces a drug sensitive, wild-type Eco GyrA which, 
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in the presence of ofloxacin, forms an irreversible covalent 
poison complex during replication that kills the cell, despite the 
presence of the chromosomal mutant GyrA which is drug 
resistant. 

Figure 3C is a diagram depicting a Mxe GyrA intein-splicing 
system for the selection of agents which block intein splicing: 
the blocking of splicing of the Mxe GyrA intein out of the 
homologous Eco GyrA extein results in the expression of the 
inactive drug sensitive Eco GyrA and the chromosomal mutant 
GyrA, which is ofloxacin-resistant. Hence, in the absence of an 
active drug sensitive Eco GyrA (due to the blockage of splicing,) 
the host grows. 

Figure 3D is an amino acid sequence comparison of part of 
the E. coli GyrA (SEQ ID NO:1) and M. xenopi GyrA (SEQ ID NO:2) 
sequences, indicating that the GyrA exteins are very similar in 
amino acid sequence, especially at the intein insertion site 
marked by the arrow. 

Figure 3E is a gel indicating efficient splicing of the Mxe 
GyrA intein in the homologous Eco GyrA extein. The position of 
Eco GyrA is indicated by the solid black box and the position of 
the precursor comprising the Mxe GyrA intein in Eco GyrA is 
indicated by the black and white boxed marker with the white box 
indicating presence of the intein. 
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Figure 4A-1 is a diagram depicting the intersection of 
DnaB with DnaC which is required to load DnaB onto the DNA 
replication machinery. 

Figure 4A-2 is a diagram depicting the sequestration of 
DnaC by a mutant E. coli DnaB helicase which leads to disrupted 
DNA replication and cell death. 

Figure 4B is a diagram depicting a Mtu DnaB helicase 
intein-splicing system for the selection of agents which block 
intein splicing: the splicing of the Mtu DnaB helicase intein out of 
a dominant lethal mutant Mtu DnaB helicase extein produces 
mutant Mtu DnaB helicase which sequesters Eco DnaC and 
poisons replication despite the presence of the chromosomal Eco 
DnaB helicase, as a result, the host dies. 

Figure 4C is a diagram depicting a Mtu DnaB helicase intein- 
splicing system for the selection of agents which block intein 
splicing: the blocking of splicing of Mtu DnaB helicase intein out 
of a dominant lethal mutant Mtu DnaB helicase extein prevents 
the sequestration of Eco DnaC; chromosomal Eco DnaB helicase 
is expressed and the host grows. 

Figure 4D is an amino acid sequence comparison of part of 
the E. coli DnaB helicase (SEQ ID NO:3) and M. tuberculosis DnaB 
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helicase (SEQ ID NO:4) sequences indicating that the amino acid 
sequences are very similar and that the site in E. coli DnaB that 
was mutated to make it cytotoxic is conserved in M. 
tuberculosis DnaB (first on larged sequence) and that the intein 
insertion site is also conserved in E. coli DnaB (marked by the 
arrow). 

Figure 5A is a diagram depicting the production of a 
combinatorial peptide library using chicken a-spectrin and the 
screening of these peptides for those that block Mxe GyrA 
helicase intein splicing, "aa" represents a portion of spectrin 
which can be randomized. The spectrin scaffold is represented 
by a trapazoid and the different amino acid sequences by 
various other shapes. If the spectrin binds to the precursor, 
splicing is blocked. The system has three components: a host 
cell expressing T7 RNA polymerase, the spectrin library and the 
intein plus GyrA gene. The latter two genes are present on a 
single plasmid under control of T7 RNA polymerase promoters. 

Figure 5B-1 is a flow diagram indicating multiple-round 
selection of combinatorial peptides that block Mxe GyrA. 

Figure 5B-2 is a flow diagram indicating Mtu DnaB helicase 
intein splicing. 
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Figure 5C is a gel indicating that peptides p814-818 (SEQ 
ID N0:5, SEQ ID NO:6, SEQ ID N0:7, SEQ ID NO:8, SEQ ID NO:9 and 
SEQ ID NO:10) block splicing of Mtu DnaB in E.coli. This is a 
Western block using anti-T7 tag antibody to detect the T7 tag 
at the N-terminus of each DnaB protein. In p815rev t the 
selected blocking peptide sequence in a-spectrin has been 
replaced with the wild type spectrin sequence to demonstrate 
that inhibition of splicing is due to the selected peptide 
sequence. The bands and markers on the right represent the 
precursor, a putative C-terminal cleavage product and the 
spliced DnaB exteins, respectively from top to bottom of the 
Western Blot. Size markers are in lane M. 

Figure 6 is a diagram depicting the production of a 
combinatorial peptide library using chicken a-spectrin and the 

screening of these peptides for those that block Mtu DnaB intein 
splicing in Mtu DnaB. E. coli GyrA gyrase. M aa M represents a 
portion of spectrin which can be randomized- The spectrin 
scaffold is represented by a trapazoid and the different amino 
acid sequences by various other shapes. If the spectrin binds to 
the precursor, splicing is blocked. The system has three 
components: a host cell expressing T7 RNA polymerase, the 
spectrin library and the intein plus GyrA gene. The latter two 
genes are present on a single plasmid under control of T7 RNA 
polymerase promoters. 
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Figure 7 is a table showing the effect of the single amino 
acid preceding the Mxe GyrA intein in a heterologous extein 
context on splicing and N-terminal cleavage by DTT. 

Figure 8 is a flow chart for choosing native precursors, 
homologous exteins or heterologous exteins to develop a 
selection or reporter system for testing agents that inhibit or 
activate splicing of an intein. 

Figure 9 is a summary of the various methods of selecting 
agents that inhibit or activate protein splicing. Each system is 
based on a merodiploid cell containing an intein plus and an intein 
minus extein gene. 

Figure 10 depicts the scheme for creating random 
mutations in the Mxe gyrA intein by error prone PCR of the 
intein followed by cloning of the mutated intein genes into the 
E.coli Mxe gyrA extein. 



Figure 11 depicts the selection scheme based on the GyrA 
selection described in Example I in which the presence of a drug 
kills cells where the intein has spliced. Clones that do not splice 
at Temperature 1 grow, while replica plated clones that splice at 
lower Temperature 2 do not grow. 
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Figure 12 show cell lysates from wild type or mutated 
intein clones were electrophoresed in SDS acrylamide gels. A 
temperature sensitive clone grown at 37°C (labeled 'A') fails to 
splice, while the wild type intein clone splices (labeled 'WT'). 
Wild-type levels of splicing are observed in the mutant clones (V 
6) when shifted to 16°C overnight. 

Figure 13 illustrates the Mxe GyrA intein sequence (SEQ ID 
NO:46) with mutations found in the temperature sensitive 
splicing clones indicated below the wild-type residue. 

Figure 14 illustrates the positioning of the mutations in 
the temperature sensitive splicing clones on the Mxe GyrA intein 
3-D structure. The two panels depict opposite sides of the Mxe 
GyrA intein with a single Alanine preceding the intein. Double 
amino acid change indicates that the clone had more than one 
mutation. 

DETAILED DESCRIPTION 

The present invention is directed to methods of selecting 
or screening for mutations or agents that block or activate 
protein splicing of inteins using natural precursors or by 
inserting inteins in homologous extein genes. These mutations 
or agents can be used to activate or keep inactive enzymes in 
vivo or in vitro for pharmacological, chemotherapeutic, or 
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biotechnologfcal purposes. In contrast, these same methods can 
be used to select agents that block or activate splicing in a non- 
homologous extein if no genetic selection system or screen can 
be generated for the native extein protein. 

The in vivo control of protein splicing mediated by a 
blocking or activating peptide, or other agent that can enter a 
cell, acting on a controllable intervening protein sequence 
(CIVPS) has been described (U.S. 5,834,247, supra, at Example 
17). In the present invention, It should be noted that a non- 
controllable IVPS, or intein, is used to identify agents that will 
convert the IVPS into a CIVPS. The blocking of such splicing 
activity by specific agents such as peptides or natural products, 
and analogues thereof, is particularly useful in combating 
pathogens such as Mycobacterium tuberculosis, Mycobacterium 
leprae, Mycobacterium avium, or Candida tropicaiis by blocking 
the splicing of essential proteins in those organisms. 

Approximately 97 inteins have been identified and are 
available from public databases (Perler, Nucleic Acids Res. 
22:1125-1127 (1994), Perler, Nucleic Acids Res. 27:346-347 
(1999), Pietrokovski, S., Protein Sci., 7:64-71 (1998) and 
Dalgaard, et al., J. Comput. Biol., 4:193-214 (1997). Sequencing 
projects of small prokaryotic genomes (e.g., Mycobacterium 
tuberculosis, Mycobacterium leprae, and Methanococcus 
jannaschii) already account for the majority of published intein 
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sequences. Host genes of these inteins are often involved in 
such essential cellular functions as DNA replication, expression, 
or various metabolic processes (compiled in: Perler, Nucleic 
Acids Res. 25:1087-1093 (1997), Perler, Nucleic Acids Res. 
27:346-347 (1999), Pietrokovski, S., Protein Sci., 7:64-71 
(1998) and Dalgaard, et aL, J. Comput. Biol., 4:193-214 (1997). 
Hence, the disruption of these essential functions via the 
blocking of intein splicing by peptides, or other agents, 
represents a means by which to screen for anti-microbial and 
anti-pathogenic agents. 

Generally, a positive selection system consists of a gene 
that is detrimental to a host organism depending on the growth 
media or the host strain genetic background. The gene product 
is toxic for the cell, inhibiting growth or killing the host unless 
the gene product is inactivated. In the context of a protein 
splicing genetic system, a positive selection system is defined 
as a system that allows selection against the splicing of an 
intein inserted in-frame into a host gene (see Figure 2). If 
splicing occurs in the precursor protein containing the intein, the 
cytotoxic host protein will be active and inhibit cell growth or kill 
the cell; if splicing is disrupted the cytotoxic host protein will be 
inactive and allow cell growth. The same description applies to 
reporter systems where detection of the host protein is scored, 
rather than selection for organism viability. Many reporter genes 
are known, the most common example is the Blue/white screen 
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involving G-galactosidase function on X-gal to produce a blue 
color. 

In accordance with one embodiment of the present 
invention, there is provided a positive selection system for 
identifying agents which block or activate protein splicing which 
comprises a host cell which contains (1) a copy of the extein 
gene (either episomal, chromosomal or synthetic) encoding a 
mutant or naturally drug-resistant form of a target enzyme; 
and (2) a wild type or mutant form of the extein gene (either 
episomal, chromosomal or synthetic) encoding the target 
enzyme that is sensitive to the drug, into which is inserted an 
intein, wherein the spliced form of the intein-containing target 
enzyme is toxic to the host organism upon interaction with a 
certain drug. In this system, the host cell so transformed will 
express the drug sensitive enzyme if the intein is properly 
spliced, resulting in reduced viability of the organism because 
the spliced product is dominantly lethal or cytotoxic to the host 
organism, despite the fact that the drug-resistant homologous 
gene is also expressed. The intein-les copy of the gene is 
required to maintain viability in cells in which splicing of the 
plasmid borne extein gene is blocked. 

In one preferred embodiment, a plasmid-encoded, drug 
sensitive gene is the naturally occurring intein/extein precursor 
or its homolog containing an intein, which intein may be either 
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naturally occurring or inserted, and drug sensitivity may be 
naturally occurring in this precursor gene or introduced by 
mutation in the extein portion of the gene. This system results 
in death of all cells where splicing occurs and thus provides a 
system for selecting for mutations, drugs, chemicals, peptides, 
etc. which block splicing in vivo since cell viability requires 
blockage of splicing. 

In a particularly preferred embodiment, the Mycobacterium 
xenopi GyrA intein (Mxe GyrA) (SEQ ID NO:11) (Telenti ef a/., J. 
Bacteriol. 179:6378-6382 (1997) is inserted into the 
homologous extein of E. coli GyrA (see Figure 3D). In E. coli, 
GyrA is an essential gene that encodes for the A subunit of the 
E. coli gyrase hetero-tetramer protein complex. E. coli gyrase is 
a type II topoisomerase involved in DNA relaxation at the origin 
of replication of the bacterial chromosomal DNA (Swanberg and 
Wang, J. MoL Biol. 197:729-736 (1987)). The wild type E. coli 
GyrA binds irreversibly to quinoline drugs such as ofloaxcin, 
preventing DNA relaxation during replication, and leading to cell 
death (see Figure 3A). However, certain mutants of wild type E. 
coli GyrA are drug-resistant, while retaining gyrase activity. The 
generation of an E. coli GyrA merodiploid host cell which contains 
a chromosomal copy of a drug-resistant gyrA gene with a 
second intein-containing, drug sensitive gyrA gene results in a 
drug sensitive E.coli host, since in this case, drug sensitivity is 
dominant. The drug sensitive phenotype is dominant because the 
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drug sensitive GyrA forms an irreversible covalent poison 
complex with the drug that interferes with DNA replication 
(Figure 3A). 

By merodiploid we mean that the cell contains an extra 
copy of a gene (or several genes) which has been introduced into 
the cell by any means known to one skilled in the art, such as 
transformation, infection, conjugation, plasmids, virus, phage, or 
by generating a transgenic strain and which may be present on 
either an episomal element or on the host chromosome. 

In accordance with the present invention, there is further 
provided a similar type of positive selection system (for 
identifying agents which block or activate protein splicing) which 
comprises a host cell which contains (1) a copy of the extein 
gene (either episomal, chromosomal or synthetic) encoding a wild 
type form of a target enzyme, which expresses a non-toxic 
form of the extein protein; and (2) a second a extein gene 
(either episomal, chromosomal or synthetic) encoding a 
cytotoxic form of the target enzyme , into which is inserted an 
intein. In this system, the merodiploid host cell expresses the 
cytotoxic enzyme if the intein is properly spliced. Thus, cells 
must be treated with chemicals, agents or peptides that block 
splicing of the cytotoxic enzyme at all times when the cytotoxic 
enzyme is expressed. The cytotoxic extein must be dominantly 
lethal, as the intein-less copy of the extein gene is also 
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expressed. The intein-less copy of the gene is required to 
maintain viability in cells in which splicing of the plasmid borne 
extein gene is blocked. 

In one preferred embodiment, instead of using an intein 
inserted Jnto a cytotoxic foreign extein homolog, the natural 
intein precursor may be mutated to produce a cytotoxic extein 
enzyme after splicing of the intein. 

In a particularly preferred embodiment, the Mycobacterium 
tuberculosis DnaB precursor (Cole et a/., Nature, 393:537-544 
(1998)) is mutated in the extein region to a cytotoxic form 
based on the known cytotoxic mutation in E.coli DnaB, where 
Arg231 was mutated to Cysteine (Marszalek and Kaguni, J. Biol. 
Chem., 267:19334-19340 (1992) and Shrimankar, et al., J. 
Bacterid., 174:7689-7696 (1992)). DNA helicases are essential 
proteins that unwind a DNA duplex to yield a single-stranded DNA 
intermediate required for replication, recombination, and repair 
(LeBowitz and McMacken, J. Biol. Chem., 261:4738-4748 (1986) 
and Lohman, MoL, Microbiol., 6:5-14 (1992)). The hexameric E. 
coli helicase encoded by the dnaB gene interacts with an 
hexameric DnaC complex and ATP. Some DnaB mutants are 
dominant lethal (Bouvier and Oreglia, C.R. Acad. Sci. Hebd 
Seances Acad. Sci. D., 280:649-652 (1975) and Maurer and 
Wong, J. Bacteriol., 170:3682-3688 (1988), Saluja and Godson, 
J. Bacteriol., 177:1104-1111 (1995) and Sclafani, et al., MoL 
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Gen. Genet. 182:112-118 (1981)). The R231C mutant protein is 
deficient in ATP hydrolysis, helicase activity, and replication 
activity at the chromosomal origin of replication resulting in cell 
death. As shown in Figure 4D, Mtu DnaB contains this same 
Arginine (R227), and mutating it to Cysteine renders the Mtu 
DnaB gene cytotoxic. This mutation is dominantly cytotoxic in E. 
coli, and both the E.coli and M. tuberculosis DnaB proteins result 
in sequestering of the E. coli DnaC protein into inactive 
complexes, preventing DnaC from 'loading' DnaB onto the E. coli 
DNA replication fork (see Figure 4A-1 and Figure 4A-2). 

Both of the positive selection systems described in the 
present invention utilize either native or homologous foreign 
exteins in order to optimize protein splicing and avoid the 
ineffecient splicing which can result from insertion of the intein 
into non-homologous foreign extein. 

Co-transformation of the host cell in these positive 
selection systems with a plasmid engineered for the expression 
of an in vivo peptide library or transformation with a plasmid 
that contains both the selection marker and the in vivo peptide 
library (as in Figure 6) allows for the direct selection of clones 
expressing a peptide that blocks (or alternatively activates) 
protein splicing. In vivo expression of peptides may be hampered 
by the host's efficient proteolytic degradation systems. 
Therefore, expression of these peptides in vivo in the context of 
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larger proteins is preferred, especially in surface loop regions of 
larger proteins. In vivo expression of peptides fused to larger 
proteins has been achieved for example, in the catalytic loop of 
thioredoxin (Colas et al. f Nature, 380:548-550 (1996)), and it is 
possible to express peptides fused within many different 
proteins.* Peptides expressed in-frame in highly soluble, well 
expressed, thermostable, solvent-exposed loops of a protein are 
less subject to in vivo proteolysis or degradation and such 
fusions enhance the functional expression of peptides in a cell. 

In a preferred embodiment, a combinatorial peptide library 
in a fragment of chicken a-spectrin is constructed, as 

previously described (see U.S. 5,834,247, supra at Example 17). 
The EF hand region of chicken a-spectrin was chosen because its 
structure is known, its EF hand domain forms a small protein 
with a stable structure, and it has a flexible surface loop. The 
structure of the chicken a-spectrin EF hand domain was 

elucidated by NMR analysis (Trave, et aL, EMBO J. 14:4922-4931 
(1995)) . The term EF hand describes a type of protein tertiary 
structural motif consisting of a helix, a turn (loop) and a second 
helix. The EF hand domain of chicken a-spectrin is located at the 
carboxy terminus of chicken a-spectrin. Its 84 amino acid 
structure is arranged in two EF hand helix-turn-helix motifs 
separated by a 14 amino acid long flexible linker (SEQ ID NO:12). 
The protein is extremely soluble without any detectable 
precipitation or aggregation even at concentrations of up to 10 
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mM. The linker loop is mainly unstructured in solution and 
mutagenesis data show that minor deletions or insertions in the 
loop do not disturb the stabilizing hydrophobic interactions 
between the 2 EF-hand. 

We have taken advantage of this last property to insert 
random peptides in the linker region between the chicken oc- 
spectrin EF hands. Peptide libraries of various sizes can be 
investigated. It will be readily apparent to the skilled artisan 
that alternative methods of producing in vivo peptide libraries 
for screening may be utilized and are within the scope of the 
present invention. 

Although the systems discussed above select for agents 
that block splicing of native or homologous exteins, it will be 
recognized by those of skill in the art that similar strategies can 
be used for screening with reporter genes to look for agents 
that inhibit expression of active reporter genes. It will likewise 
be recognized by the skilled artisan that similar strategies can 
be used to look for agents that activate splicing of a splicing- 
deficient intein in its native context or in a homologous extein 
gene, as long as the extein gene can be converted into a 
reporter. For example, in one embodiment, a reagent that 
activates a splicing-deficient precursor results in expression of 
an active reporter protein, resulting in inhibition of cell growth 
or detection of the active reporter protein. 
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Although the Escherichia coli (E. coli) GyrA and the 
Mycobacterium tuberculosis (M. tuberculosis) DnaB selection 
systems are described above, it will be recognized by the skilled 
artisan that any genetic selection system can also be used to 
isolate peptide sequences or other agents which disrupt or 
catalyze protein splicing. Likewise, although we describe the 
specific use of the M. xenop/ GyrA intein (SEQ ID NO:11) and the 
M. tuberculosis DnaB intein (SEQ ID NO:13), the skilled artisan will 
recognize that this strategy is equally applicable to any intein 
(see, e.g., Perler, et al., Nucleic Acids Res. 27:346-347 1999)) 
present in its native or homologous context. It will likewise be 
readily apparent to those of skill in the art that alternative 
means of generating peptide libraries for screening may be used. 
It will likewise be recognized by the skilled artisan that in the 
absence of a selection or screening system for the native extein 
protein that similar strategies can be applied to splicing of 
inteins in less optimal heterologous extein systems. 

Activating and/or inhibiting agents identified by the 
screening methods of the instant invention may also be used to 
control the in vivo expression of a target protein. Once the only 
copy of an active extein gene contains an intein, gene function 
can be inhibited if the organism is treated with an agent that 
blocks splicing. On the other hand, if a splicing-impaired intein is 
used, gene function can be activated if the organism is treated 
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with an agent that activates splicing. The agents and splicing 
can be modulated at any time during the development and life of 
the organism by addition or removal of the splicing activating or 
inhibiting agent. 

Similarly, controllable splicing may be used to deliver 
active proteins at specific times or to specific places. In many 
instances, therapeutic drugs can be cytotoxic to the host and 
would be best if only active at the target site. For example, 
chemotherapy drugs are often generally cytotoxic and adverse 
reactions in normal cells could be eliminated if the drug could be 
specifically activated in the tumor. If one has a drug that is at 
least partially proteinacious, an intein that can be activated or 
inhibited by a second agent, as described above, could be 
inserted into the protein portion of the therapeutic agent. The 
drug is then administered in an inactive form, and subsequently 
activated in the desired target tissue. 

As noted above, it is believed that inefficient protein 
splicing in the foreign extein context occurs as inteins may 
exhibit substrate specificity, preferring their native extein 
sequence to that of foreign exteins. In accordance with another 
embodiment, there is provided a method of overcoming this 
limitation by employing an intein with one or more, preferably 
one to five, amino acid residues from its native extein. Inclusion 
of such amino acid residues may be at either or both ends of the 
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homologous intein. Inclusion of amino acid residues from the 
native extein will facilitate methodologies of the present 
invention. Such amino acid residues from the native extein may 
be incorporated into the precursor by methods well known to 
those skilled in the art. 

Insertion of a target intein in the heterologous extein may 
be at any of a number of sites, including but not limited to, a 
surface location in the extein, within a loop region of the extein, 
at a protease sensitive site, or a position known to facilitate 
insertion of additional amino acid residues without inactivating 
the extein. 

The Examples presented below are only intended as 
specific preferred embodiments of the present invention and are 
not intended to limit the scope of the invention except as 
provided in the claims herein. The present invention 
encompasses modifications and variations of the methods 
taught herein which would be obvious to one of ordinary skill in 
the art. 

The references cited above and below are hereby 
incorporated by reference herein. • 
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EXAMPLE I 

A Mxe GyrA Intein-Mediated Positive Selection System 

for Inhibition of Protein Splicing 

A Positive? Seleotion System Based on Blo cking Splioing of tha 
Mxft GvrA Intein (IVPS1 in F noli fiyrA- RnnkgrmmH 

Gyrases are essential multimeric enzymes involved in DNA 
replication in bacteria (Swanberg and Wang, 1987). Both gyrase 
subunit A and B have been extensively studied as drug targets in 
bacterial human pathogens (e.g., Mycobacteria, Salmonella, 
Enterbacteriaceae, Citrobacter, Pseudomonas, Streptococcus, 
Staphylococcus, Yersinia, Rhodobacter, Haemophilus, Neisseria, 
Providencia). The GyrA subunit of gyrases can complex with 
quinoline drugs, such as ofloxacin, and induce cell death. The 
complex formation of quinolines with gyrase is followed by a 
rapid and irreversible inhibition of DNA synthesis, inhibition of 
growth, and induction of the SOS response (see Figure 3A). At 
higher drug concentrations, cell death occurs as double-strand 
DNA breaks in the bacterial chromosome are released from 
trapped gyrase complexes. 

In many gram-negative bacteria (e.g., E. coli), resistance 
to quinoline arises from mutation of the Gyrase A protein in the 
quinoline resistance determining region such as gyrA96 or 
gyrA83 . Those mutations may involve Ser83 in E. coli GyrA. In a 
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merodiploid cell containing a drug resistant gyrA gene (such as 
gyrA83) on the chromosome and a wild type (gyrA+) copy of 
gyrA on a plasmid, the wild type gene (drug sensitive) product of 
gyrA is dominant. By merodiploid, we mean that the cell contains 
an extra copy of a gene (or several genes) which has been 
introduced into the cell by any means known to one skilled in the 
art, such as transformation, infection, conjugation, plasmids, 
virus, phage, or by generating a transgenic strain and which may 
be present on either an episomal element or on the host 
chromosome. The wild type gyrA gene can be introduced into 
the cell by any means known to one skilled in the art and should 
not be considered limited to a plasmid. Many E. coli strains are 
available that contain gyrA mutants which are resistant to 
quinoline drugs such as ofloxacin. However, this system is also 
applicable to any other host system where (1) the chromosomal 
copy of the gyrA gene is resistant to quinoline drugs, (2) the 
introduced sensitive gyrA gene is present as the heterologous E. 
coli gyrA::Mxe gyrA intein fusion or the native Mxe gyrA or M. 
leprae gyrA genes, and (3) the intein containing drug sensitive 
gyrA gene is operably linked with the appropriate signals for 
expression in that host. Likewise, the Mxe GyrA intein could be 
inserted into the gyrA gene of any experimental host cell just as 
it was inserted into the E. coli gyrA gene. Likewise, any gyrA 
intein can be used in the above selection system, whether 
present at the same site as the Mxe GyrA intein or a different 
site. 
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Since sensitivity to quinoline drugs is dominantly cytotoxic, 
in the presence of these drugs, a gyrA+/gyrA83 host cell is not 
viable because wild type GyrA proteins can still bind drug 
molecules and poison DNA replication (see Figure 3B). The co- 
expression of a chicken cc-spectrin peptide library (as described 
in U.S. 5,834,247 supra, at Example 17) allows for the positive 
selection of peptides that can disrupt splicing of the Mxe GyrA 
intein. Likewise, this system can be used to screen for any 
agent that inhibits splicing of the Mxe GyrA intein in vivo or for 
Mxe GyrA intein mutations that block splicing. 

Insertion of the Mxe GvrA intein (I V P S) ge n e into tha homoing nng 

site in the F nnli GyrA gene 

In some Mycobacteria, the gyrase A subunit active site is 
often interrupted by a naturally occurring allelic intein (IVPS) 
near the active site tyrosine residue (e.g., Mycobacterium 
flavescens, Mycobacterium gordonae, Mycobacterium kansasii, 
Mycobacterium leprae, Mycobacterium malmoense and 
Mycobacterium xenopi, (Sander, et al., Microbiology, 144:589- 
591 (1998), Telenti, et al., J. Bacterid., 179:6378-6382 (1997), 
Perler, et al., Nucleic Acids Res. 27:346-347 (1999), 
Southworth, BioTechniques, 27:110-121 (1999)). The M. xenopi 
(Mxe) GyrA intein (IVPS) utilized in the system described herein, 
lacks the endonuclease signature motifs and other sequences 
similar to homing endonucleases and has been extensively 
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studied as a prototype minimal intein (IVPS) (Klabunde, et al M 
Nature Struct. Biol. 5:31-36 (1998), and Telenti, et al., J. 
Bacteriol., 179:6378-6382 (1997) and Southworth 
BioTechniques, 27:110-121 (1999)). The most favorable 
insertion site for the Mxe GyrA intein in E. coli GyrA is the 
homologous insertion site compared to the native Mxe GyrA 
extein, since it shares sequence identity with the native Mxe 
GyrA extein (Figure 3D). The intein (IVPS) insertion site was 
chosen immediately upstream of the conserved tyrosine active 
site residue at position 122 in E. coli GyrA: 

PS A A AM R Y122c it-shn T123 EIRLAKI 
(SEQ ID NO:1 4) (SEQ ID NO:45) 

Amino acid numbers refer to the position of the amino 
acids in E. coli GyrA (see SEQ ID NO:1 for a partial E. coli GyrA 
sequence). The underlined amino acids (single letter amino acid 
code) are the amino acids identical in both E. coli and Mxe GyrA 
exteins (see Figure 3D). The lower case letters represent Mxe 
GyrA intein (IVPS) amino acids. The dashes represent the 
remainder of the residues of the Mxe GyrA intein (IVPS) that are 
not listed (see SEQ ID NO:2, for the complete Mxe GyrA intein 
sequence). 

First, the E. coli gyrA gene was cloned by polymerase chain 
reaction (PCR) using E. coli K12 genomic DNA under the following 
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experimental conditions. A forward primer 5'-GATA 
GGCTAGCGATGAGCGACCTTGCGAGAG-3' (SEQ ID NO: 15) and 
reverse primer 5 ' -TG A AG C A ATTG AATTATTCTTCTTCTG G CTCG -3' 
(SEQ ID NO:16) were used in a PCR mixture containing 20 U/ml 
Vent® Exo+ DNA polymerase (New England Biolabs, Inc., Beverly, 
MA), 400 //M of each dNTP, 4 nM each primer and 100 ng of E. 
coli K12 genomic DNA. Amplification was carried out in a Perkin- 
Elmer/Cetus (Emeryville, CA) thermal cycler 480 for 1 min at 
95°C and then cycled at 45°C, 30 sec; 72°C, 2 min and 30 sec; 
95°C, 30 sec for 20 cycles. The PCR products from one 50 jjI 
PCR reaction and 2 yt/g of pCYB1 (IMPACT™ I kit, New England 
Biolabs, Inc., Beverly, MA) were separately digested with 250 
U/ml of Nhel and 1000 U/ml of Mfel in the presence of 100 
fjg/m\ of BSA. The digestion was performed at 37°C for 1 hour. 
Digested PCR products and plasmid DNA were separated by 
agarose gel electrophoresis and the excised bands further 
purified using QIAEX II beads as described by the manufacturer 
(Qiagen, Studio City, CA). Ligation was carried out at 20°C for 1 
hour using a 1:4 ratio of vector to insert and 40,000 U/ml of T4 
ligase. Ligation products were transformed into E. coli strain 
ER2267 competent cells. Recombinant plasmids were checked by 
Nhel/Mfel digestion which results in the excision of the cloned 
insert in properly ligated recombinants. One of the resultant 
correct plasmids containing the E. coli gyrA gene placed under 
transcriptional control of the pCYB1 pTac promoter was named 
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pEA500. The gyrA insert was checked by DNA sequencing to 
insure that no sequence errors were introduced by PCR. 

Second, to facilitate cloning of the Mxe GyrA intein into E. 
coli GyrA, unique silent Notl and Xbal restriction enzyme sites 
were engineered 7 bp and 44 bp, respectively, away from each 
side of the E. coli GyrA active site residue, Y122 of pEASOO by 
site-directed silent mutagenesis. The QuickChange kit was used 
following the manufacturer's instructions (Stratagene, La Jolla, 
CA) with mutagenic primers: Notl oligonucleotides: 5'- 
CGGCGACTCTGCGGCCGCAATGCGTTATA CGG-3' (SEQ ID NO:17) 
and 5'-CCGTATAACGCATTGCGGCCGCA GAGTCGCCG-3' (SEQ ID 
NO:18), and Xbal oligonucleotides: 5 f - 

GAACTGATGGCCGCTCTAGAAAAAGA GACGG-3' (SEQ ID NO:1 9) 
and 5'-CCGTCTC I I I I I CTAGAGCGGCCA TCAGTTC-3' (SEQ ID 
NO:20). The resultant plasmid containing E. coli GyrA with Notl 
and Xbal restriction enzyme sites was called pEA502. 

Third, a 68 bp DNA cassette with flanking Notl/Xbal 
restriction sites was designed to be cloned into the pEA502 
unique Notl/Xbal sites. This cassette introduced a unique Blpl 
silent restriction enzyme site 10 bp away from Y122 which 
subsequently allowed cloning of any intein (IVPS or CIVPS) near 
the E. coli GyrA active site Y 122 using Notl and Blpl restriction 
enzyme sites. This cassette was generated by annealing 2 
complementary oligonucleotides : 5'-GGCCGCAA 
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TGCGTTATACGGAAATCCGCTTAGCGAAAATTGCCCATGAACTGATG 
GCCGAT-3' (SEQ ID NO:21) and 5'-CTAGATCGGCATCAGTTCATG 

■ 

GGCAATTTTCGCTAAGCGGATTTCCGTATAACGCATTGC-3' (SEQ ID 
NO:22). 5 nM of each oligonucleotide was combined in 1X T4 
Hgase buffer (New England Biolabs, Inc., Beverly, MA)' boiled for 
5 min and cooled down to room temperature. 10 jug of pEA502 
were digested with Notl and Xbal using 500 U/ml each enzyme in 
the presence of 100 /jg/m\ of BSA. The digestion was performed 
at 37°C for 2 hours. The digested plasmid DNA was separated by 
agarose gel electrophoresis and the excised band further 
purified using QIAEX II beads as described by the manufacturer 
(Qiagen, Studio City, CA). Ligation of the oligonucleotide 
cassette and the digested plasmid DNA was carried out at 20°C 
for 1 hour using a 1:2 ratio of vector to insert and 40,000 U/ml 
of T4 ligase. Ligation products were transformed into E. coli 
strain ER2267 competent cells. Recombinant plasmids were 
checked by Blpl digestion which results in the linearization of the 
correct recombinant plasmids. One of the resultant correct 
plasmids was named pEA523. 

Fourth, the Mxe GyrA intein (IVPS) was amplified by PCR 
with the addition of primer derived Notl and Blpl sites using 
pMIP(Mxe)#4 plasmid DNA (Telenti et al., J. Bacterid, 179:6378- 
6382 (1997)) under the following experimental conditions: 
Forward primer 5'-CGACCCGCGCGGCCGCAATGC 
GTTATTG CATC ACG GG AG -3' (SEQ ID NO:23) and reverse primer 
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5'<aCCAAAGGCGCTAAGCX3GATTTCCGTGTTGTGGCTGACGAACC 
CG-3' (SEQ ID NO:24) were used in a PCR mixture containing 10 
U/ml Taq DNA polymerase (Promega, Madison, Wl), 200 jjM of 
each dNTP, 4 nM each primer and 100 ng pMIP(Mxe)#4 DNA. 
Amplification was carried out in a Perkin-Elmer/Cetus 
(Emeryville, CA) thermal cycler 480 at 94°C, 30 sec; 50°C, 30 
sec; 72°C, 15 sec for 10 cycles. The PCR products of one 50 //I 
reaction and 2 //g of pEA523 were separately digested using 
1000 U/ml of Notl and 300 U/ml of Blpl in the presence of 100 
//g/ml of BSA. The digestion was performed at 37°C for 2 hours. 
Digested PCR products and plasmid DNA were separated by 
agarose gel electrophoresis and the excised bands further 
purified using QIAEX II beads as described by the manufacturer 
(Qiagen, Studio City, CA). Ligation was carried out at 20°C for 2 
hours using a 1:3 ratio of vector to insert and 40,000 U/ml of 
T4 ligase (New England Biolabs, Inc., Beverly, MA). Ligation 
products were transformed into E. col i strain Top10F' 
(Invitrogen, Carlsbad, CA) competent cells. Recombinant 
plasmids were checked by EcoNI restriction enzyme digestion 
which results in the linearization of the correct recombinant 
plasmids. One of the resultant correct plasmids was named 
pEA600 and contains the in-frame insertion of the Mxe GyrA 
intein (IVPS) into the E. coli GyrA extein at the active site Y122 
(see Figures 3D and 3E). 
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Construction of a vector for co-expression of a peptide lihrary 
and the Mxe GyrA intein (I VPS) -F noli Gyr A selection system 

As described above, the Mxe GyrA intein (IVPS) was 
inserted into the active site of E„ coli GyrA. In this homologous 
context, the Mxe GyrA intein (IVPS) splices efficiently to 
produce active E. coli GyrA. As is detailed below, the E. coli 
gyrA::Mxe gyrA intein (IVPS) gene fusion described above was 
cloned under control of a T7 RNA polymerase promoter and 
introduced into E. coli, ER2726 (New England Biolabs, Inc., 
Beverly, MA). ER2726 expresses 17 RNA polymerase and has 
the gyrA83 mutation which makes the chromosomal gyrA gene 
resistant to quinoline drugs. In the presence of quinoline drugs 
such as ofloxacin, only splicing deficient clones can survive (see 
Figures 3B and 3C), since the spliced gyrA product is sensitive 
to ofloxacin in a dominant cytotoxic manner (see above). 

The spectrin scaffold was cloned into EA600 as follows. 
First, a 30 bp DNA cassette with flanking PflMI/Apal restriction 
sites was designed to be cloned into the unique PflMI/Apal sites 
in pEA600 (which also contains the E. coli gyrA::Mxe gyrA 
fusion). This cassette introduced a unique Sphl site in place of 
the laciq gene and was synthesized by annealing 2 
oligonucleotides: 5'-ATGGGCATGCATATATATA TAGGCCTGGGCC- 
3' (SEQ ID NO:25) and 5*-CAGGCCTATATATAT 
ATGCATGCCCATTCG-3' (SEQ ID NO:26). 5 nM of each 
oligonucleotide was combined in 1X T4 ligase buffer (New England 
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Biolabs, Inc. Beverly, MA), boiled for 5 min and cooled down to 
room temperature. 5 jjq of pEA600 was digested using 320 U/ml 
of PflMI and 800 U/ml of Apal in the presence of 100 //g/ml of 
BSA. The digestion was performed at 37°C for 2 hours. The 
digested plasmid DNA was separated by agarose gel 
electrophoresis and the excised band further purified using 
QIAEX II beads as described by the manufacturer (Qiagen, Studio 
City, CA). Ligation was carried out at 16°C for 1 hour using a 1:1 
ratio of vector to insert and 40,000 U/ml of T4 ligase. Ligation 
products were transformed into E. coli strain XL1B (Stratagene, 
La Jolla, CA) competent cells. Recombinant plasmids were 
checked by Sphl digestion which results in the linearization of 
the correct recombinant plasmids. One of the resultant correct 
plasmids was named pEA661 . 

Second, unique Sgfl and sites Clal were engineered on 
either side of the spectrin loop region in a spectrin encoding 
plasmid (Trave, et al., EMBO J. 14:4922-4931 (1995)) by site- 
directed silent mutagenesis using the QuickChange kit as 
described by the manufacturer (Stratagene, La Jolla, CA). The 
Sgfl oligonucleotides were: 5'-GTTTAAGTCTTGCTTGCGATC 
GCTTGGCTATGACCTGCC-3' (SEQ ID NO:27) and 5'-GGGCAGGT 
CATAGCCAAGCGATCG CAAGCAAGACTTAAA-3' (SEQ ID NO:28) 
and Clal oligonucleotides were: 5'-GCCTGACCCCGAATTTGAATC 
G ATTCTTG AC ACTG TTG -3' (SEQ ID NO:29) and 5'-CAACAGTGT 
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CAAGAATCGATTCAA ATTCGGGGTCAGGC-3' (SEQ ID NO:30). The 
resulting plasmid was called pEA670. 

Third, the Sgfl/Clal mutated spectrin gene was cloned by 
PCR into pEA661 under the following experimental conditions. A 
forward primer 5'-AATGGTGCATGCAAGGAGATGGCGCCCAAC 
AGTC-3' (SEQ ID NO:31) and reverse primer 5'-GCTTTGGCTAG 
CTTTCCTGTGTCACCTGCTGATCATGAACG-3' (SEQ ID NO:32) were 
used as described in the Expand High Fidelity PCR system 
(Boehringer Mannheim, Indianapolis, IN) in the presence of 1X 
buffer 2 (New England Biolabs, Beverly, MA) and 50 ng of 
pEA670 DNA. Amplification was carried out in a Perkin- 
Elmer/Cetus (Emeryville, CA) thermal cycler 480, 94°C, 30 sec; 
45°C, 30 sec; 72°C, 45 sec; for 15 cycles. The PCR products of 
one 50 //I tube and 5 //g of pEA661 were Nhel/Sphl digested 
using 250 U/ml of each enzyme. The digestion was performed at 
37°C for 2 hours. Digested PCR products and plasmid DNA were 
separated by agarose gel electrophoresis and the excised bands 
further purified using QIAEX II beads as described by the 
manufacturer (Qiagen, Studio City, CA). Ligation was carried out 
at 16°C for 1 hour using a 1:5 ratio of vector to insert and 
40,000 U/ml of T4 ligase (New England Biolabs, Inc., Beverly, 
MA). Ligation products were transformed into E. coli strain 
Novablue DE3 (Novagen, Madison, Wl) competent cells. 
Recombinant plasmids were checked by Nhel/Sphl digestion which 
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results in the excision of the cloned insert. One of the resultant 
correct plasmids was named pEA671. 

Fourth, the first E. coli gyrA Pvul site in pEA671 was 
eliminated by site-directed silent mutagenesis using the 
QuickChange kit as described by the manufacturer (Stratagene, 
La Jolla, CA) and oligonucleotides 5-GCGTAAAGCTCGCGACC 
GTGCTCATATCC-3' (SEQ ID NO:33) and 5'-GGATATGAGCACGGTC 
GCGAGCTTTACG C-3' (SEQ ID NO:34), resulting in plasmid 
pEA681. 

Fifth, the 200 bp Acc65l/Hindlll fragment from pEA681 
was transferred to pEA671 replacing the Acc691/Hindlll 
fragment of EA671. Plasmids pEA671 and pEA681 were 

i 

digested in 1X buffer 2 (New England Biolabs, Inc., Beverly, MA) 
using 500 U/ml of Acc65l and 500 U/ml of Hindlll (New England 
Biolabs, Inc., Beverly, MA). The digestion was performed at 37°C 
for 2 hours. Digested plasmids were separated by agarose ge| 
electrophoresis and the excised bands further purified using 
QIAEX II beads as described by the manufacturer (Qiagen, Studio 
City, CA). Ligation was carried out at 16°C for 3 hours using a 
1:3 ratio of vector to insert and 40,000 U/ml otTA ligase (New 
England Biolabs, Inc., Beverly, MA). Ligation products were 
transformed into E. coli strain XL1 B (Stratagene, La Jolla, CA) 
competent cells. Recombinant plasmids were checked by Pvul 
digestion. One of the resultant correct plasmids was named 
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pEA682. This plasmid contains both the a-spectrin peptide 
library and the E. coli gyrA::Mxe gyrA intein-based selection 
system on the same plasmid, both under control of a T7 RNA 
polymerase promoter (Figure 5A). 

A Theoretical Screening for Peptides That Disrupt Protein 
Splicing of the Mxe GyrA intein (IVPS) in P. coli GyrA 

In the theoretical embodiment detailed below, random 
peptides of 7-12 amino acids would be inserted in-frame into the 
loop between the 2 EF-hand motifs of a-spectrin, contained, as 
described above, on the same plasmid as E. coli gyrA::Mxe gyrA 
intein fusion. The resulting plasmids would be electro- 
transformed into strain ER2726 (New England Biolabs, Inc., 
Beverly, MA). Transformants would be selected in LB liquid 
growth media in the presence of ampicillin, ofloxacin (Sigma, St. 
Louis, MO) and IPTG to allow selection against the splicing 
proficient clones. Ampicillin selects for the presence of the 
plasmid and ofloxacin selects for peptides that block splicing, 
since the spliced E. coli GyrA protein would be sensitive to the 
drug and lead to cell death. Plasmid DNA would be isolated from 
selected clones and digested with Sgfl and Clal to isolate DNA 
fragments encoding the selected spectrin peptides. The spectrin 
DNA loop fragments would then be cloned back into the original 
selection plasmid. Iterative rounds of drug selection and "back- 
cloning" would be performed (Figure 5B). Iterative screening 
helps enrich for agents that truly block splicing while eliminating 
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clones that survived selection because of some other mutation 
or anomaly. Final selected clones would be grown individually in 
liquid culture and the plasmid-encoded E. coli GyrA specifically 
induced by IPTG. Crude protein cell extracts would be 
electrophoresed and blotted for immuno-staining. Clones in which 
the E. coli GyrA spliced product was not detected would be 
considered positives, i.e. clones in which splicing had been 
disrupted, potentially by the selected peptide. 

The random peptide library would be synthesized in vitro 
using the following protocol, as was done in Example II. A single 
strand/double strand DNA hybrid cassette would be synthesized 
by annealing 2 oligonucleotides: 5'-TGTCAAGAATC 
GATTCAAATTCGGGGTCAGGCTCTCC((W)NN) 7 . 12 ATAGCCAAGCGA 

T-3' (SEQ ID NO:35) and S'P-CGCTTGGCTAT-S' (SEQ ID NO:36). 5 
fjg of oligonucleotide SEQ ID NO:35 and 3 molar equivalents of 
oligonucleotide SEQ ID NO:36 would be mixed together in the 
presence of 0.1 M NaCI in a final volume of 50 //I. The mixture 
would then be boiled and immediately cool down to room 
temperature in the same boiler. The single stranded random 
nucleotide part of the DNA hybrid cassette formed by annealing 
of the 2 oligos would be extended using 400 //M of each dNTPs 
and 60 U/ml of Klenow DNA polymerase (New England Biolabs, 
Inc., Beverly, MA) in a final volume of 200 /j\ in 1X EcoPol buffer 
(New England Biolabs, Beverly, MA). The extension reaction would 
be left 20 minutes at 37°C and further purified using QIAEX II 
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beads as described by the manufacturer (Qiagen, Studio City, 
CA). 60 jjg of pEA682 (50 //g/ml) would be digested in 1X Buffer 
2 (New England Biolabs, Inc., Beverly, MA) using 250 U/ml of Sgfl 
(Promega, Madison, Wl) and 500 U/ml of BspDI (New England 
Biolabs, Inc., Beverly, MA) (an isoschizomer of Clal) in the 
presence of 100 //g/ml of BSA. The digestion would be 
performed at 37°C for 2 hours. Purified cassettes (50//I) would 
then be digested in 1X Buffer 4 (New England Biolabs, Inc., 
Beverly, MA) using 500 U/ml of Clal (New England Biolabs, Inc., 
Beverly, MA) in the presence of 100 //g/ml of BSA. Cassettes 
would be further purified using QIAEX II beads as described by 
the manufacturer (Qiagen, Studio City, CA). Digested plasmid 
DNA would be electrophoresed on 0.7% agarose gel and the 
excised bands further purified using QIAEX II beads as described 
by the manufacturer (Qiagen, Studio City, CA). Ligation would be 
carried out at 16°C for 1 hour using a 1:1 ratio of vector (2 
ng//y|) to insert and 1 ,600 U/ml of T4 ligase (New England 
Biolabs, Inc., Beverly, MA). Ligation products would then be 
electro-transformed into E. coli strain ER2744 (New England 
Biolabs, Inc., Beverly, MA) competent cells (10 9 pUC18 
transformants/^g) using 1-2 /vg of total ligated plasmid for 
each 200 /j\ aliquot of competent cells, at 2.5 kV/cm in a 2 mm 
cuvette (BIORAD, Richmond, CA). Cells would be allowed to 
recover in a shaker for 1 hour at 37°C. Recovered 
transformants would be inoculated at 1/100 dilution ratio into 
LB liquid growth media containing appropriate amounts of 
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ofloxacin (Sigma, St. Louis, MO), 100 //g/ml of ampicillin and 1 
mM IPTG. Transformants would be incubated overnight at 37°C. 

Plasmid DNA would be isolated from a 100 ml of the overnight 
culture using a tip100 column (QIAGEN, Studio City, CA), 
Clal/Sgfl digested as above and electrophoresed on a 4% GTG 
Nusieve agarose gel (FMC BioProducts, Rockland, ME). The 57 to 
72 bp spectrin loop DNA inserts (depending upon whether the 
peptide library contained 7 or 12 random amino acids) would be 
purified using QIAEX II beads as described by the manufacturer 
(Qiagen, Studio City, CA) and cloned back into Sgfl/Clal digested 
and purified selection plasmid as described above. This protocol 
would be repeated 3 times to enrich the pool of transformants 
for peptide clones having the most biologically active sequences 
against the protein splicing of the Mxe intein (IVPS). Finally 
selected clones would be grown individually in 10 ml LB containing 
100 /yg/ml ampicillin at 37°C and induced with 1 mM IPTG for 3 
hours. Crude protein cell extracts would be electrophoresed on a 
10-20% gradient gel (Novex, San Diego, CA). The gel would then 
be electro-blotted for immuno-staining using anti-His tag 
antibodies (Sigma, St. Louis, MO) to detect GyrA::Mxe intein 
(IVPS) protein splicing products. One would expect to see the 
absence of spliced product. The clones would then be sequenced 
to determine the amino acid sequences which had been selected. 
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Hypothetical Screening with Agents that i nhibit Splicing 

At this stage, the vector, pEA600, is amenable for 
screening with any type of agent that blocks splicing, using a 
similar screening protocol as for peptides that block splicing, 
described above. However, in this case, pEA600 or similar 
plasmids can be directly screened without having to clone the 
peptide library contained within the chicken ct-spectrin gene as 
described above. The protocol would involve treating individual 
cultures with single or pooled agents that can enter the cell and 
looking for cell growth, using any means known to one skilled in 
the art. Agents that block splicing allow the cell to grow in the 
presence of ofloxacin 

Summary 

In summary, we describe the cloning of the Mxe gyrA intein 
gene into the E. coli gyrA extein gene for use in selecting for 
agents that inhibit splicing. The Mxe GyrA intein splices well in 
the E. coli GyrA extein, resulting in production of active E. coli 
GyrA protein. The E. coli GyrA extein was used with the Mxe 
GyrA intein because the Mle GyrA intein did not splice efficiently 
in E. coli in its native context and the precursor was mostly 
insoluble in E. coli. Because the GyrA intein and extein sequences 
are very similar (Telenti, et al., J. bacteriol, 179:6378-6382 
(1997) and Perler, et al., Nucleic Acids Res. 27:346-347 
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(1999)), mixing and matching of inteins, exteins and 
experimental hosts resulted in an efficient model system for 
examining agents that modulate splicing of GyrA inteins, using 
exteins that have similar insertion sites and therefore similar 
splicing active sites as in the native context. 

EXAMPLE II 

A M. TUBERCULOSIS DnaB INTEIN-MEDIATED POSITIVE 

SELECTION SYSTEM 

A Positive Selection System using thft Mt u DnaB intein (IVPS) in 
its Native Mtu DnaB Extein in F nntir Background 

The hexameric E. coli helicase encoded by the dnaB gene 
interacts with an hexameric DnaC complex and ATP. Some DnaB 
mutants are dominant lethal (Bouvier and Oreglia, C.R. Acad. Sci. 
Hebd. Seances Acad. Sci D., 280:649-652 (1975), Maurer and 
Wong, J. Bacteriol 170:3682-3688 (1988), Saluja and Godson, J. 
Bacterid. 177:1104-1111 (1995) and Sclafani, et al. f Mol. Gen. 
Genet., 182:112-118 (1981)). By dominant or dominantly 
cytotoxic, we mean that the toxicity occurs even if homologous 
proteins are present which are not cytotoxic or resistant to the 
drug, i.e., the cytotoxic effect dominates irrespective of the 
presence of non-cytotoxic homologs. The mutant protein is 
deficient in ATP hydrolysis, helicase activity, and replication 
activity at the chromosomal origin of replication resulting in cell 
death (see Figure 4A). Despite only moderate protein sequence 
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identity between bacterial helicases, arginine 231 is located in a 
conserved motif proposed to interact directly with DnaC 
(Marszalek and Kaguni, J. Biol. Chem., 267:19334-19340 (1992) 
and Shrimankar, et al., J. Bacteriol., 174:7689-7696 (1992)). M. 
tuberculosis (Mtu) DnaB has a naturally occurring intein at the 
carboxy-terminus and an arginine at position 227 homologous to 
arginine 231 of E. coli DnaB (see Figure 4D). 

We have demonstrated proficient protein splicing of the 
Mtu DnaB intein (IVPS) from the Mtu DnaB precursor protein in E. 
coli and also have shown that the R227C mutation results in 
dominant lethality. Therefore, a merodiploid cell containing a wild 
type dnaB gene and a Mtu DnaB (R227C) gene is not viable unless 
protein splicing can be disrupted (see Figures 4B and 4C). By 
merodiploid we mean that the cell contains an extra copy of a 
gene (or several genes) which has been introduced into the cell 
by any means known to one skilled in the art, such as 
transformation, infection, conjugation, plasmids, virus, phage, or 
by generating a transgenic strain and which may be present on 
either an episomal element or on the host chromosome. The co- 
expression of a chicken oc-spectrin peptide library (as described 
in U.S. 5,834,247 supra, at Example 17) allows for the positive 
selection of peptides that can disrupt splicing of the M. 
tuberculosis DnaB intein (see Figure 5A). Likewise, this system 
can be used to screen for any agent that inhibits splicing of the 
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Mtu DnaB intein or any other DnaB intein in vivo or for DnaB 
intein mutations that block splicing. 

» 

Constnmtinn of a Positive Selection System using the Mtn DnaR 
Intein (IVPfi ) in its Native Mtu DnaR Fytein 

As described in detail below, the Mtu dnaB gene has been 
cloned by PCR under T7 RNA polymerase transcriptional control. 
In E. coli, the Mtu DnaB intein (IVPS) splices very efficiently from 
its natural precursor to produce the Mtu DnaB helicase. The Mtu 
dnaB gene has been mutagenized at position 227 from arginine 
to cysteine and the plasmid transformed into BL21(DE3)-Gold 
(Stratagene, La Jolla, CA). In the presence of the T7 RNA 
polymerase (induced by IPTG) only splicing deficient clones can 
survive (see Figure 4C). 

First, the Mtu dnaB gene was cloned by PCR using M. 
tuberculosis H37Ra genomic DNA under the following 
experimental conditions. A forward primer 5'-AGGTGAGAA 
TTCATGGCGGTCGTTGATGACC-3' (SEQ ID NO:37) and reverse 
primer S'-TATATAAAGCTTTCATGTCACCGAGCCATGTTGGCG-S' 
(SEQ ID NO:38) were used as described in the Extend Long 
Template PCR system (Boehringer Mannheim, Indianapolis, IN) in 
the presence of 1X buffer 3 and 100 ng of M. tuberculosis 
genomic DNA. Amplification was carried out in a Perkin- 
Elmer/Cetus (Emeryville, CA) thermal cycler 480 for 2 min at 
94°C and then cycled at 45°C, 30 sec; 68°C, 2 min; 95°C, 1 min 
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for 25 cycles. The PCR products of one 50 p\ reaction and 5 
of pET21a (Novagen, Madison, Wl) were digested using 1000 
U/ml of EcoRI and 800 U/ml of Hindlll in 1X EcoRI buffer (New 
England Biolabs, Inc., Beverly, MA). The digestion was performed 
at 37°C for 1 hour. Digested PCR products and plasmid DNA 
were separated by agarose gel electrophoresis and the excised 
bands further purified using QIAEX II beads as described by the 
manufacturer (Qiagen, Studio City, CA). Ligation was carried out 
at 16°C for 1 hour using a 1:5 ratio of vector to insert and 
40,000 U/ml of T4 ligase (New England Biolabs, Inc., Beverly, 
MA). Ligation products were transformed into E. coli strain 
Novablue DE3 (Novagen, Madison, Wl) competent cells. 
Recombinant plasmids were checked by EcoRI/Hindlll digestion 
which results in the excision of the cloned inserts. One of the 
resultant correct plasmids was named pEA807. The sequence of 
the dnaB insert was checked by DNA sequencing. 

Second, the 1200 bp Bglll/SgrAI fragment from pEA682 
containing the spectrin-based peptide library was transferred to 
pEA807. Plasmids pEA682 and pEA807 were digested in 1X 
buffer 2 (New England Biolabs, Inc., Beverly, MA) using 500 U/ml 
of Bglll and 240 U/ml of SgrAI. The digestion was performed at 
37°C for 1 hour. Digested plasmids were separated by agarose 
gel electrophoresis and the excised bands further purified using 
QIAEX II beads as described by the manufacturer (Qiagen, Studio 
City, CA). Ligation was carried out at 16°C for 1 hour using a 1:5 
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ratio of vector to insert and 40,000 U/ml of T4 ligase (New 
England Biolabs, Inc., Beverly, MA). Ligation products were 
transformed into E. coli strain ER2726 (New England Biolabs, 
Inc., Beverly, MA) competent cells. Recombinant plasmids were 
checked by Clal/Ncol digestion. One of the resultant correct 
plasmids was named pEA808. 

Third, the Mtu dnaB gene was mutagenized to R227C by 
PCR under the following experimental conditions. A forward 
primer 5'-AGGTGAGAATTCATGGCGGTCGTTGATGACC-3' (SEQ ID 
NO:39) and reverse primer S'-TTTCCCACGCCCGGGCaCGCCGC 
CACGATGACC-3' (SEQ ID NO:40) were used as described in the 
Extend Long Template PCR system (Boehringer Mannheim, 
Indianapolis, IN) in the presence of 1X buffer 3 (New England 
Biolabs, Inc., Beverly, MA) and 500 ng of pEA808 DNA. 
Amplification was carried out in a Perkin-Elmer/Cetus 
(Emeryville, CA) thermal cycler 480 for 2 min at 94°C and then 
cycled at 45°C, 30 sec; 72 6 C, 45 sec; 95°C, 1 min for 20 cycles. 
The PCR products of one 50 jj\ reaction and 2 //g of pEA808 
were digested overnight at 37°C using 100 U/ml of EcoRI and 40 

U/ml of Srfl in the 1X PCR-Script reaction buffer (Stratagene, 
La Jolla, CA). Digested PCR products and plasmid DNA were 
separated by agarose gel electrophoresis and the excised bands 
further purified using QIAEX II beads as described by the 
manufacturer (Qiagen, Studio City, CA). Ligation was carried out 
at 16°C for 1 hour using a 1:1 ratio of vector to insert and 
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40,000 U/ml of T4 ligase (New England Biolabs, Inc., Beverly, 
MA). Ligation products were transformed into E. coli strain 
XL10-Kan (Stratagene, La Jolla, CA) competent cells. 
Recombinant plasmids were checked by Ndel/Ascl. One of the 
resultant correct plasmids was named pEA809. 

Fourth, the Notl dnaB-spectrin module was inverted on 
plasmid pEA809. 2 pg of pEA809 DNA was digested with 500 
U/ml of Notl at 37°C for 2 hours and digestion products split 
into two tubes. One tube containing 1 pg of Notl digested 
pEA809 was incubated further with 100 U/ml Calf Intestinal 
Alkaline Phosphatase (CIP, New England Biolabs, Inc., Beverly, 
MA) for 20 minutes at 37°C. Digested plasmid DNA from both 
tubes was separated by agarose gel electrophoresis and the 
excised bands further purified using QIAEX II beads as described 
by the manufacturer (Qiagen, Studio City, CA). Ligation of the 
vector band from the CIP treated tube and the insert band from 
the CIP untreated tube was carried out at 16°C for 1 hour using 
a 1:1 ratio of vector to insert and 40,000 U/ml of T4 ligase 
(New England Biolabs, Inc., Beverly, MA). Ligation products were 
transformed into E. coli strain XL1-Blue (Stratagene, La Jolla, 
CA) competent cells. Recombinant plasmids were checked by 
Sacll digest. One of the resultant correct plasmids was named 
pEA810. 
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Fifth, the lacN gene from pEA810 was removed and 
replaced by a smaller DNA fragment from pBR322. pEA810 and 
pBR322 DNA were digested using 500 U/ml EcoRV and 500 U/ml 
Hindlll in buffer 2 (New England Biolabs, Inc., Beverly, MA) at 37°C 
for 2 hours. Digested plasmid DNA was separated by agarose gel 
electrophoresis and the excised bands further purified using 
QIAEX II beads as described by the manufacturer (Qiagen, Studio 
City, CA). Ligation of the pEA810 vector band and the pBR322 
insert band was carried out at 16°C for 1 hour using a 1:1 ratio 
of vector to insert and 40,000 U/ml of T4 ligase (New England 
Biolabs, Inc., Beverly, MA). Ligation products were transformed 
into E. coli strain XL1-Blue (Stratagene, La Jolla, CA) competent 
cells. One of the resultant correct plasmids was named pEA813. 

Sixth, the first Mtu R227C dnaB Aatll site of pEA813 was 
eliminated by site-directed silent mutagenesis using the 
QuickChange kit (Stratagene, La Jolla, CA) and oligonucleotides 
y-GCCGCCGATCCGCGACATCGTAGATTTCGGCC -3' (SEQ ID NO:41) 
and reverse primer 5'-GGCCGAAATCTACGA 
TGTCGCGGATCGGCGGC-3' (SEQ ID NO:42) resulting in plasmid 
pEA832. 

Seventh, the wild type intein containing Mtu DnaB gene of 
plasmid pEA808 was shuffled back into pEA832. pEA808 and 
pEA813 DNA were digested using 1x buffer 1 (New England 
Biolabs, Inc., Beverly, MA) with 500 U/ml EcoRI and 500 U/ml 
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Hindlll at 37°C for 1 hour. Digested plasmid DNAs were separated 
by agarose gel electrophoresis and the excised bands further 
purified using QIAEX II beads as described by the manufacturer 
(Qiagen, Studio City, CA). Ligation of the pEA813 vector band 
and the pEA808 insert band was carried out at 16°C for 1 hour 
using a 1:1 ratio of vector to insert and 40,000 U/ml of T4 
ligase (New England Biolabs, Inc., Beverly, MA). Ligation products 
were transformed into E. coli strain XL1-Blue (Stratagene, La 
Jolla, CA) competent cells. One of the resultant correct plasmids 
was named pEA825. 

Eighth, the Aatll site elimination in pEA825 was performed 
identically as described for pEA813, resulting in plasmid pEA835. 

Screening for Peptides That Disrupt th* Mtn nnaR intain ( |y ps) 

Protain Splining 

The following is an actual experimental example 
demonstrating the use of this system to select for peptides 
that block splicing. As detailed below, random peptides of 7-12 
amino-acids were inserted in-frame into the loop of the 2 EF- 
hand motif of a-spectrin, contained, as described above, on the 
same plasmid as the Mtu DnaB intein (IVPS) selection system. 
The resulting plasmids were electro-transformed into the T7 
RNA polymerase E. coli strain ER2744 (New England Biolabs, Inc., 
Beverly, MA) (see Figures 5A and 5B). Transformants were 
selected in LB liquid growth media in the presence of ampicillin 
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and IPTG to allow selection against the splicing proficient clones. 
Plasmid DNA was isolated from selected clones and digested to 
isolate DN A fragments encoding the selected spectrin peptides. 
The selected spectrin loop region DNA was cloned back into the 
original selection plasmid. Iterative rounds of selection and 
"back-cloning" were performed (Figure 5B). After selection, the 
selected spectrin peptide were cloned into pEA825 (containing 
the non-toxic DnaB gene) for expression analysis. Final selected 
clones were grown individually in liquid culture and the plasmid- 
encoded Mtu dnaB gene specifically induced by IPTG. Crude 
protein cell extracts were electrophoresed and blotted for 
immuno-staining. Clones in which the Mtu DnaB spliced product 
was not detected were considered positives, i.e. clones in which 
splicing had been disrupted, potentially by a selected peptide. 

The random peptide library was synthesized in vitro using 
the following protocol. A single strand/doubte strand DNA hybrid 
cassette was synthesized by annealing of 2 oligonucleotides : 5'- 
TGTCAAGAATCGATTCAAATTCGGGGTC AGGCTCTCC((W)NN) 7 _i 2 
ATAGCGAAGCGATCGCAGGCAGCTTTT 
AAAGCCCTGATGGTTCAGACGT-3' (SEQ ID NO:43) and 5'P- 
CTGAACCATCAGGGC-3' (SEQ ID NO:44). 5//g of oligonucleotide 
SEQ ID NO:43 and 3 molar equivalents of oligonucleotide SEQ ID 
NO:44 were mixed together in the presence of 0.1 M NaCI in a 
final volume of 50 The mixture was boiled and immediately 
cooled down to room temperature in the same boiler. The single 
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strand random nucleotide part of the DNA hybrid cassette 
formed by annealing of the 2 oligos was extended using 400 //M 
of each dNTPs and 60 U/ml of Klenow DNA polymerase (New 
England Biolabs, Inc., Beverly, MA) in a final volume of 200 //I in 
5 1X EcoPol buffer (New England Biolabs, Inc., Beverly, MA). The 

extension reaction was incubated 20 minutes at 37°C and 
further purified using QIAEX II beads as described by the 
manufacturer (Qiagen, Study City, CA). 60 //g of pEA832 (50 
//g/ml) were digested in 1X Buffer 4 (New England Biolabs, Inc., 

10 Beverly, MA) using 400 U/ml of Aatll (New England Biolabs, Inc., 

Beverly, MA) and 500 U/ml of Clal (New England Biolabs, Inc., 
Beverly, MA) in the presence of 100 //g/ml of BSA. The digestion 
was performed at 37°C for 2 hours. Synthesized random 
cassettes (20 //g/ml) were digested in 1X Buffer 4 (New England 

15 Biolabs, Inc., Beverly, MA) using 500 U/ml of Clal (New England 

Biolabs, Inc., Beverly, MA) in the presence of 100 //g/ml of BSA. 
Cassettes were further purified using QIAEX II beads as 
described by the manufacturer (Qiagen, Studio City, CA). 
Digested plasmid DNA was electrophoresed on a 0.7% agarose 

20 gel and the excised bands further purified using QIAEX II beads as 

described by the manufacturer (Qiagen, Studio City, CA). 
Ligation was carried out at 16°C for 1 hour using a 1:1 ratio of 
vector (2 ng///l) to insert and 1,600 U/ml of T4 ligase (New 
England Biolabs, Inc., Beverly, MA). Ligation products were 

25 electro-transformed into E. coli strain ER2744 (New England 

Biolabs, Inc., Beverly, MA) competent cells (competency of 



WO 01/32831 



PCTYUS00/29596 



- 56 - 

1x10 9 pEA835 transformants//yg) using 2 jjg of total ligation 
product for each 200 //I aliquot of competent cells, at 2.5 
kV/cm in a 2 mm cuvette (BIORAD, Richmond, CA). Cells were 
allowed to recover in a shaker for 1 hour at 37°C. Recovered 
transformants were inoculated at 1/100 dilution into LB liquid 
growth media containing 100 /yg/ml of ampicillin and 1 mM IPTG. 
Transformants were incubated overnight at 30°C. Plasmid DNA 
was isolated from the overnight culture using tiplOO columns 
(QIAGEN, Studio City, CA)), Aatll/Clal digested as above and 
electrophoresed on a 4% GTG Nusieve agarose gel (FMC 
BioProducts, Rockland, ME). The 57 to 72 bp spectrin loop DNA 
inserts were purified using QIAEX II beads as described by the 
manufacturer (Qiagen, Studio City, CA) and cloned back into 
Aatll/Clal digested and purified selection plasmid (pEA832) as 
described above. This protocol was repeated 3 times to enrich 
the pool of transformants for peptide clones having the most 
biologically active sequences against the protein splicing of the 
Mtu DnaB intein (IVPS). Finally selected spectrin modules were 
cloned into a pEA832 homologous plasmid containing the wild 
type Mtu dnaB gene (pEA825) and grown individually in 10 ml LB 
containing 100 jug/ml ampicillin at 37°C and induced with 1 mM 

IPTG for 3 hours. Crude protein cell extracts were 
electrophoresed on a 10-20% gradient gel (Novex, San Diego, 
CA). The gel was further electro-blotted for immuno-staining 
using anti-T7 tag antibodies (Novagen, Madison, Wl) to detect 
Mtu DnaB protein splicing products (Figure 5C). Lane Eco DnaB 
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contains extracts of T7-tagged E. coli DnaB without the intein. 
pMtuDnaB contains extracts from a clone expressing only Mtu 
DnaB. Lanes p814, p815, p816, p817, and p818 contain 
extracts of the clones pEA814, pEA815, pEA816, pEA817, and 
pEA818, respectively, encoding peptides selected for inhibition 
of splicing. To demonstrate that the inhibition of splicing was 
due to the peptide inserted into the chicken cc-spectrin loop, the 
selected sequence of pEA818 was replaced with the spectrin 
sequence, DLPMVEE (SEQ ID NO:10) to generate clone 
pEA818rev, and extracts loaded on lane p818rev. Splicing of 
pEA818rev occurred as efficiently as with the pMtu DnaB clone 
that expresses the wild-type spectrin protein. Note that in the 
absence of splicing, much of the DnaB precursor undergoes 
cleavage at the intein C-terminal splice junction. 

The sequence of the inserted peptides in these clones is 
as follows: 



pEA814 
PEA815 
pEA816 
pEA817 
pEA818 



TVQSTKR (SEQ ID NO:5) 

RPAPRPL (SEQ ID NO:6) 

PTARTYE (SEQ ID NO:7) 

PTRPTAPPLNFS (SEQ ID NO:8) 

HPNPHPTLSGQR (SEQ ID NO:9) 



pEA818rev 



DLPMVEE (SEQ ID NO: 10) 



We have thus demonstrated that this system can be used 
to select for peptides that block splicing of the Mtu DnaB intein. 
This system is amenable to selection of any modulators of 
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splicing of the Mtu DnaB intein or other DnaB inteins, as long as 
the agent can enter a cell. 

EXAMPLE III 

IN VIVO CONTROL OF PROTEIN SPLICING FOR 
chemotherapeutic PURPOSES OR TO MAKE 
CONTROLLABLE GENE KNOCKOUTS 

The selection and screening systems described for 
selection of agents that modulate protein splicing can also be 

* 

applied to intein-less versions of the extein gene to select for 
agents that inhibit or activate the extein gene product. All of 
the selection and screening systems described in this patent are 
based on the activity or inactivity of the extein portion of the 
precursor. If one deletes the intein from the intein-containing 
gene by methods known to one skilled in the art, then one can 
select for agents that block or activate extein activity also 
using the methods described for inhibiting or activating splicing 
of the intein containing precursor, since these latter methods 
involve assaying extein function. For example, if one deletes the 
intein from the Mtu DnaB gene by methods known to one skilled 
in the art, then one can select for agents that block activity of 
the cytotoxic Mtu DnaB protein using the methods described for 

» 

inhibiting splicing of the DnaB intein. M. tuberculosis can then be 
attacked using a cocktail of two agents that block activity of 
the essential DnaB protein, making it more difficult for the 
organism to develop resistance to these agents. 
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We have previously described the insertion of a CIVPS or 
I VPS into a foreign gene. In these cases, protein splicing could be 
controlled by temperature, mutation, pH, photo-activated 
blocking groups, phosphorylation or peptides" (Comb, et al., U.S. 
Patent No. 5,834,247 and Comb, et al., U.S. Patent No. 
5,496,714). In this Example we describe a general method for 
selecting specific protein splicing inhibiting or activating agents 
that are capable of controlling protein splicing in vivo or in vitro. 
The methods are equally applicable to genetic selection systems 
or reporter systems. By genetic selection, we mean, in this 
Example, that viability or growth rate of the test organism is 
monitored during the experiment, while a reporter system in this 
Example refers to the monitoring of a marker, such as color 
detection, fluorescence, phenotype, etc., rather than cell 
viability. Genetic selection or reporter systems are used to 
identify agents that can either disrupt or catalyze protein 
splicing of a given intein, depending on the context of the 
experiment. Any genetic selection or reporter system known to 
one skilled in the art can be used to isolate agents which disrupt 
or catalyze protein splicing. This strategy is equally applicable to 
any intein present in a foreign context or in its native or 
homologous context (e.g., the insertion of an intein at the same 
position in an homologous extein). However, use of the native 
extein is preferable because it best represents the enzyme 
target of the intein. If the native precursor does not express 
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well or splice well in the experimental host organism, then the 
intein can be inserted into the same site in that host organism's 
homolog of the native extein or in another extein homolog with 
desired properties for testing, using any method known to one 
skilled in the art or described in the previous Examples. This 
method of finding agents that modulate splicing is applicable to 
any host, as long as the protein splicing precursor is operably 
linked to the appropriate control signals for transcription and 
translation in that host. As the target organism may not be an 
easy experimental model for identifying agents that modulate 
protein splicing, the agent may first be identified in a model 
system and then tested in the final target organism. This 
strategy is summarized in Figure 8. 

Experiments involving inhibition of splicing start with a 
precursor that contains a fully active intein that may or may not 
be controllable. The goal of this experiment is to find agents 
that can be used to control splicing of this intein. In experiments 
involving activation of splicing, a CIVPS (controllable intein) or an 
inactive intein is required, as the goal is to find agents that 
activate the previously inactive intein. The intein may be 
inactivated by any means known to one skilled in the art, such as 
temperature sensitive inteins, inteins with mutations in amino 
acids known to be involved in catalysis that slow down or block 
splicing (including the conserved amino acids at both splice 
junctions and in intein Block B, (Perler, Nucleic Acids Res. 
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25^087-1093 (1997), Pietrokovski, Protein Sci., 3:2340-2350 
(1994)) inteins which have been randomly mutated and selected 
for inhibition or blockage of splicing. 

A positive selection system is preferred. In general, a 
positive selection system consists of a gene that is detrimental 
to a host organism depending on the growth media or the host 
strain genetic background. The gene product is static or lethal 
for the cell, kilting the host or preventing growth unless the gene 
product is inactivated. The gene product may be directly 
cytotoxic to the host in a dominant manner, as in the PnaB 
example (Example II) or it may be dominantly cytotoxic in 
response to a drug which the chromosomal copy of the gene is 
resistant to, as in the GyrA example (Example I). By dominant or 
dominantly cytotoxic, we mean that the toxicity occurs even if 
homologous proteins are present which are not cytotoxic or 
resistant to the drug, i.e., the cytotoxic effect dominates 
irrespective of the presence of non-cytotoxic homologs. In the 
context of a protein splicing inhibition system, positive selection 
involves a system that allows selection against the splicing of an 
IVPS or intein. If splicing occurs, the cytotoxic extein protein 
will be active and kill the cell or inhibit growth; if splicing is 
disrupted, the cytotoxic extein protein will be inactive and cells 
will grow. Cell growth can be monitored by any means known to 
one skilled in the art, including, but not limited to obervation of a 
colony on solid media, optical density, monitoring of fluorescent 
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reporters of cell growth such as green fluorescent protein of 
luciferase activity. The extein gene may be an unrelated 
reporter system or the natural extein of the intein (either using 
the natural precursor or inserting the intein into a homologous 
extein context) (Figure 8). In this context, selection systems 
have the advantage that only agents that inhibit splicing allow 
cell growth and are thus easily found amongst the background of 
agents that have no effect on splicing or are directly toxic to 
the cell. If the agents to be tested are also expressed in the 
host cell, then one examines the colonies that survive on the 
plate. If the agent to be tested is not expressed in the host cell, 
but is instead added to the media, then aliquots of host cells 
must be arrayed for testing with individual agents or pools of 
agents in any number of devices, such as microtiter dishes. In 
such cases, cell growth may be more easily measured if the cells 
expres a protein that leads to fluorescence, such as green 
fluorescent protein or luciferase. 

When selecting for agents that activate splicing, the intein 
is already present or is inserted into a gene whose protein 
product is required for cell growth. In the absence of splicing, 
the cell fails to grow or dies. In order to practically employ this 
selection system, a second gene is present which can rescue the 
cell in the absence of splicing. This second copy of the gene 
should be controllable, by methods such as a temperature 
sensitivity or controllable promoters, to allow cell growth until 
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the agent which activates splicing is applied or induced in the 
cell. The cells are treated with the splicing activator and then 
moved to the nonpermissive condition for activity of the second 
gene product that does not contain the intein or expression of 
this gene is turned off. Cell growth will then require splicing 
since the second gene product lacking the intein is no longer 
active. 

Another method for identifying agents that modify protein 
splicing involves screening rather than genetic selection. 
Screening systems employ reporter genes whose products can 
be readily assayed, but do not necessarily affect cell growth. 
Many reporter systems are known, such as the blue/white 13- 
galactosidase screening system. B-galactosidase acts on X-gal, 
for example, to generate a blue color; in the absence of 13- 
galactosidase activity, the X-gal remains uncolored or 'white'. 
Other reporters include those described in Bums and Beacham, 
Gene, 27:323-325 (1984) and Mechulam, et al., J. Bacterid., 
163:787-791 (1985). One can use native precursors if 
reporter systems are available for those extein genes or the 
intein can be cloned into the reporter gene (B-galactosidase in 
this Example) (see for example, Belfort, U.S. Patent No. 
5,795,731 and Comb, et al., U.S. Patent No. 5,834,247). Agents 
that inhibit splicing of an otherwise active intein will block 
reporter protein functions, such as B-galactosidase action on X- 
gal, resulting in white instead of blue clones. Agents that 
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activate splicing of an otherwise inactive or slowly acting intein 
restore reporter protein functions, resulting in blue clones using 
the G-galactosidase system as an example. If the agents to be 
tested are also expressed in the host cell, then one examines 
the colonies that survive on the plate. If the agent to be tested 
is not expressed in the host cell, but is instead added to the 
media, then aliquots of host cells must be arrayed for testing 
with individual agents or pools of agents in any number of 
devices, such as microtiter dishes. Unlike selection, all cells grow 
in reporter systems and one must determine whether the read 
out is positive or negative for each colony or microculture. 

Previous Examples have described genetic selection 
systems based on the pheS non-homologous selection system 
and the gyrA and dnaB intein/extein systems. This Example 
describes how one would screen for agents that modulate 
splicing using any selection or reporter system. Note that the 
selection or screening systems may not have been originally 
identified in the organism containing the intein. However, if a 
selection or screening system has been described for the extein 
homolog, it can be adapted to the intein containing homolog. As 
in the case of DnaB, the same mutation can be made in the intein 
containing homolog to generate a selectable phenotype for the 
intein containing extein gene. As in the case of GyrA, the 
screening system can involve a chromosomal mutation that 
leaves the host resistant to a drug; all that need be done is to 
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show that the intein containing homolog is also sensitive to the 
drug. 

Iterative screening (Figure 5B) provides a method of 
identifying lead compounds and reducing background and can be 
used in any of the schemes described below. Iterative screening 
involves repeated cycles of testing of the agent on fresh extein 
genes. It helps insure that the agent is not acting on a mutated 
extein, which could also be a by-product of screening. 

P o sitive Selection Systems for Inhibition Or Activation Of 
Protein Splicing Of An Intein In Its Natura l Precursor Or An 

Extein Homolog 

In this case, the intein of interest is naturally found in a 
target gene which can naturally serve as a selectable marker or 
reporter or which can be converted into a selectable marker or 
reporter. Initial experiments may be performed in the target 
organism or an experimentally more amenable model host such 
as bacteria, E. coli, yeast, mammalian cells, insect cells, etc. The 
decision as to whether to use the natural splicing precursor to 
select for agents that block splicing or to first insert the intein 
gene into a homologous extein gene from a model organism 
depends on the similarity amongst the extein genes, the ability 
of the natural precursor or recombinant precursors to express 
in the model hosts used for selection or screening, and the 
ability of each precursor to splice in the model hosts. (Figure 8) 
These parameters will have to be experimentally determined, 
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although the more similar the extein sequences, the more likely 
that splicing will work in the homologous extein protein from the 
model organism. Sequence comparison will indicate the 
appropriate homologous intein insertion site in the homologous 
extein gene from the model organism. 

Next, one has to determine by a literature search whether 
any genetic selection systems or screens are available for the 
target extein in any organism and whether the extein gene is 
essential for cell growth in any organism. If the target gene is 
essential, but no genetic selection or screens are available, it 
can be mutagenize directly or in model systems to attempt to 
generate a selection or reporter system. If the target gene 
product is essential to the cell, under defined conditions, the 
host gene can be either knocked out and replaced by a 
controllable copy of the gene or mutated to generate a 
temperature sensitive activity. The intein containing gene must 
the produce an active product when the host gene homolog is 
inactivated. A temperature sensitive phenotype can easily be 
generated by random or rational mutation by one skilled in the 
art. Once a selection system has been identified and the best 
splicing precursor has been determined (selecting from the 
naturally occurring precursor, or after inserting the intein into 
the homologous extein from the target or selection organism), 
testing for agents that block splicing can begin in either a model 
organism or the target organism, depending on ease of use. 
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Some of the possible schemes for identifying agents that block 
or activate splicing are shown in Figure 9. 

Scheme 1 is a method for selecting for agents that inhibit 
splicing. The selection system involves a dominant cytotoxic 
phenotype in response to a drug. By dominant cytotoxic we mean 
that the spliced product is toxic to the cell irrespective of 
expression of a resistant copy of the extein gene. The GyrA 
system described in Example I is an example of this type of 
scheme. The selection host organism contains a chromosomal 
copy of the extein gene that is resistant to the drug and allows 
growth of the organism in the presence of the drug. First, a 
merodiploid is made containing a gene which is sensitive to the 
drug and contains the intein, and a gene which is resistant to the 
drug and does not contain an intein. Second, the host containing 
the resistant extein gene and the intein containing sensitive 
extein gene is then treated with agents that can enter the cell 
or by induction of expression of agents within the cell. Finally, ... 
the selection drug is added to the cells. If the intein splices, the 
drug sensitive target protein kills the cell or inhibits growth 
when the drug is present. If any agent blocks splicing, no drug 
sensitive extein protein is made and the organism grows. 
Usually, one tests a library of compounds of any type, rather 
than a single agent, and one uses small cultures, as in microtiter 
dishes, for example. Any type of agent can be used, as long as it 
can enter the cell. Alternatively, the agent can be cloned and 
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expressed in the target cell and clones can be tested for growth 
on plates or in liquid media. Expression of combinatorial peptide 
libraries would be an example of such an agent that is expressed 
in the cell. 



Scheme 2 is a second method for selecting for agents 
that inhibit splicing. The selection system involves a dominant 
lethal phenotype in the absence of exogenous drug treatment 
that is inherent in the intein containing extein protein or can be 
introduced into the extein protein. The DnaB system described in 
Example II exemplifies this type of system. The selection host 
organism contains a wild type gene that is not toxic to the cell 
and allows growth of the organism. First, a mero-diplid is made 
containing a gene which is toxic to the cell, but contains an intein 
and an intein-less extein gene which is not toxic. Next, this host 

■ 

is treated with agents that can enter the cell before the 
cytotoxic precursor gene is expressed. Finally, expression of 
the intein containing cytotoxic extein gene is induced. If the 
intein splices, the cytotoxic target protein kills the cell or 
inhibits growth. If any agent blocks splicing, no cytotoxic target 
protein is made and the organism grows. Usually, one tests a 
library of compounds of any type, rather than a single agent, 
and one uses small cultures, as in microtiter dishes, for 
example. Any type of agent can be used, as long as it can enter 
the cell. Alternatively, the agent can be cloned and expressed in 
the target cell and clones can be tested for growth on plates or 
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in liquid media. Expression of combinatorial peptide libraries 
would be an example of such an agent that is expressed in the 
cell. 

Scheme 3 selects agents that inhibit splicing of an 
essential gene. In this case, the chromosomal copy of the gene, 
or its equivalent, is either temperature sensitive, sensitive to a 
drug in a recessive manor, or under some type of expression 
control. Alternatively, the chromosomal copy of the extein gene 
is inactivated or knocked out. The cells can grow under 
conditions where the gene product is not needed. The cells are 
then shifted to conditions which require the extein protein for 
survival. An example of this type of extein is a metabolic 
enzyme. When cells are grown in rich media, they can grow. 
However , when cells are grown in minimal media or media lacking 
the downstream product of the extein blocked metabolic 
pathway, the cells fail to grown. The intein containing target 
gene is not temperature sensitive or is resistant to the drug. If 
splicing occurs under non-permissive conditions for the 
chromosomal extein homolog, then the cells live. This system 
requires assay of cell growth in isolated containers, such as 
microtiter dish wells, for example. If the agent blocks splicing, 
then the cells will not grow under non-permissive conditions for 
activity of the intein-less copy of the extein protein. Cell growth 
can be determined by any means known to one skilled in the art, 
including, but not limited to measuring optical density or 
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presence of a fluorophore generated in the cell. First an 
experimental host must be found that contains a controllable 
copy of the extein gene or its equivalent. It is propagated under 
permissive conditions for expression of active intein-less extein 
protein. Second, this host is transformed with a vector 
containing a wild type extein gene or extein homolog gene 
containing the intein. Third, merodiploid cells containing the 
intein-plus and intein-minus copies of the extein gene, or its 
equivalent, are treated with agents to block splicing and are also 
shifted to non-permissive conditions for activity of the intein- 
less extein protein. This may involve a shift to a temperature at 
which the intein-minus protein is inactive, removal of inducers 
for expression of the intein-minus shifting to different media, or 
addition of a drug which inactivates the intein-minus protein. If 
splicing occurs, the cells will continue to grow using the intein- 
plus gene product. However, if the agent inhibits splicing, 
products of both copies of the gene are inactivated and the cells 
die. Alternatively, the agent can be cloned and expressed in the 
target cell. However, in this case, each clone must be copied or 
replica plated to maintain a living copy of the library and a copy 
to be tested for inhibition of splicing. Expression of 
combinatorial peptide libraries would be an example of such an 
agent that is expressed in the cell. 

Schemes 4-6 are methods of selecting for agents that 
activate splicing rather than inhibit it. The precursor contains an 
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inactive intein which is introduced into the cell on any type of 
vector. The agent(s) may be added individually or in pools to 
isolated cultures. Alternatively, the agent can be cloned and 
expressed in the target cell. However, in this latter case, each 
clone must be copied or replica plated to maintain a living copy 
of the library and a copy to be tested for activation of splicing. 
Expression of combinatorial peptide libraries would be an 
example of such an agent that is expressed in the cell. 

Scheme 4 is identical to scheme 1. In the presence of the 
drug, an agent that activates splicing kills the host since the 
intein-plus drug sensitive copy of the gene is active and 
dominantly cytotoxic. One assays for the absence of growth in 
isolated cultures, such as microtiter dish wells, for example. 

Scheme 5 is similar to scheme 1. An agent that activates 
splicing kills the host since the dominantly cytotoxic extein is 
active after splicing of the intein, irrespective of the presence 
of the wild type extein protein derived from the intein-minus 
gene. One assays for the absence of growth in isolated cultures, 
such as microtiter dish wells, for example. 

Scheme 6 is similar to scheme 3, except that the selection 
system requires expression of the spliced target gene for cell 
growth and selects for agents that activate splicing. In this type 
of system, the intein-minus copy of the target extein gene, or 
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its equivalent, is either temperature sensitive, sensitive to a 
drug in a recessive manor, or under some type of expression 
control. Alternatively, the chromosomal copy of the extein gene 
is inactivated or knocked out. The cells can grow under 
conditions where the gene product is not needed. The cells are 
then shifted to conditions which require the extein protein for 
survival. An example of this type of extein is a metabolic 
enzyme. When cells are grown in rich media, they can grown. 
However, when cells are grown in minimal media or media lacking 
the downstream product of the extein blocked metabolic 
pathway, the cells fail to grow. The intein containing target gene 
is not temperature sensitive or is resistant to the drug. The 
target gene containing the intein is introduced into the cell by 
any means known to one skilled in the art. In this case, the intein 
has been modified so that it can not splice under the assay 
conditions. The host copy of the gene is expressed in an active 
form under permissive conditions (permissive temperature, in 
the absence of drug, rich media under permissive expression 
conditions, etc.), allowing the cells to grow. The intein-plus copy 
of the target extein gene, containing the inactive intein, is 
introduced into the cell. After expression of the intein precursor 
is established, agents are added externally or peptide libraries 
are expressed internally to induce splicing. After allowing the 
agent to activate splicing, the cells are shifted to the 
nonpermissive condition (non-permissive temperature, in the 
presence of drug, minimal media under non-permissive 
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expression conditions, etc.). The only cells that can grow are 
those in which splicing activity has been restored by the agent. 
If an external agent is to be tested, then the agent is added to 
cells in isolated containers, such as microtiter dish wells. 
Alternatively, the agent can be cloned and expressed in the 
target cell. In this case, the library of agents can be directly 
tested for cell viability on plates. Expression of combinatorial 
peptide libraries would be an example of such an agent that is 
expressed in the cell. 

Reporter Systems For Inhibition Or Artivatin n Of Protein Splining 
Qf An Intftin In Its Natural Precursor Or An Fxtftin Hnmnlrtg 

Any extein that can be converted into a tractable 
phenotype can be used in a reporter system screen. This type of 
system requires the ability to differentiate between active and 
inactive extein by any direct or indirect means. Once the 
reporter system is available, the intein containing gene is 
introduced into the cell by any method known to one skilled in the 
art and agents that inhibit splicing are added or induced as 
above. Alternatively, an inactive intein is introduced into a cell 
and agents that activate it are added or induced as above. One 
then examines individual clones and determines whether the 
extein is active or not. 
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Systems F or Inhibition Or Activation Of Protein Splicing Of An 
Intein In A n Unrelated Fxtein Context 

The scenario for this method of identifying agents that 
inhibit or activate splicing is the same as schemes described 
above, except that the intein is placed in an unrelated extein. 
However, one must first determine that the intein splices in the 
non-homologous extein (Figure 8). To improve the probability 
that an intein will splice in a non-homologous foreign context, the 
intein insertion site should be as similar to the natural extein 
sequence as possible for at least 1 and up to 5 or more extein 
residues. If the intein is inserted into a nonessential region of 
the target protein, one could possibly modify the sequence of 
the target protein at the intein insertion site to be the same as 
the native extein sequence of that intein. The intein must be 
cloned prior to a Ser, Thr or Cys with the amino acid naturally 
following the intein being the best choice or the Ser, Thr or Cys 
codon must be inserted into the extein along with the intein 
sequence. To improve folding, surface locations on the protein 
would be preferable since they are more likely to allow the extein 
to fold independently of the intein. If the structure of the target 
protein is unknown, protease sensitive sites on the target 
protein should be good positions to insert the intein. 

Since splicing can be sequence dependent, it is optimal to 
experimentally identify agents that modify splicing in the same 
target protein that one wants to finally control. However, 
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agents could possibly also control splicing of that intein in any 
extein. New exteins may have to be treated experimentally. 

Controllable KnnrknnlQ 

Once an agent has been found which can inhibit or activate 
splicing, the homologous extein gene in the target organism is 
replaced by the homologous gene containing the intein by 
methods known to one skilled in the art. For example, this may 
be performed in a one step process by inserting the intein- 
containing gene directly into the chromosomal copy of the extein 
gene by homologous recombination. Alternatively, the intein 
containing gene is introduced into the organism and the non- 
intein containing homolog is inactivated either concurrently or 
separately and in any order of event. Once the only copy of the 
active extein gene contains a intein, gene function can be 
inhibited if the organism is treated with an agent that blocks 
splicing. On the other hand, if a splicing impaired intein is used, 
gene function can be activated if the organism is treated with an 
agent that activates splicing. The agents and splicing can be 
modulated at any time during the development and life of the 
organism by addition or removal of the splicing activating or 
inhibiting agent. For example, a gene for mouse embryogenesis 
can be replaced by an intein containing gene homolog and the 
product of that gene can be activated or inactivated at various 
times to determine when the gene product is required and if it is 
required during multiple stages of development or growth. In a 
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second example, a gene product thought to be required tor 
passage through a specific stage of the cell cycle could be 
replaced with an intein containing copy that would allow study of 
exactly when the gene product is required or to synchronize the 
culture by arresting all cells at the same point in the cell cycle 
to study the effect of any agent, etc., on a synchronized culture 
of cells. 

Use Of Controllable Inteins (OIVPS) In Therapeutics 

Several options can be envisioned for the use of 
controllable splicing to deliver active proteins at specific times 
or to specific places. In many instances, therapeutic drugs can 
be cytotoxic to the host and would be best if only active at the 
target site. For example, chemotherapy drugs are often 
generally cytotoxic and adverse reactions in normal cells could 
be eliminated if the drug could be specifically activated in the 
tumor. If one has a drug that is at least partially proteinacious, 
an intein that can be activated or inhibited by a second agent, as 
described above, could be inserted into the protein portion of 
the therapeutic agent. The drug is then administered 
systemically in an inactive form. The drug could then be 
specifically activated in the tumor or target organ by (1) 
injecting the activating agent into the tumor, (2) exposing the 
tumor to laser treatment to increase the temperature of the 
tumor and thus induce splicing of a temperature sensitive intein, 
(3) use gene therapy to target the inactive cytotoxic precursor 
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to the tumor cells and then add the splicing activator 
systemically or (4) use gene therapy to target the activating 
peptide to the tumor and add the inactive intein containing drug 
systemically. In the Examples described above, the inactive 
cytotoxic precursor or the activating peptide, respectively, 
could be transformed systemically with a vector that is not 
tissue or cell specific, and only expressed in specific target cells 
by operably linking these genes to tissue specific promoters. 

EXAMPI F IV 



METHODS FOR GENERATING TEMPERATURE 

CONTROLLABLE INTEINS 

The methods used for identifying agents that inhibit or 
activate splicing can also be used to identify inteins that are 
active at one temperature and inactive at a second temperature 
(referred to as temperature sensitive inteins). Instead of adding 
an external agent or expressing an internal agent, the intein is 
randomly mutated by any method known to one skilled in the art, 
such as error prone polymerase chain reaction (Figure 10) or 
use of combinatorial DNA sequences at specific regions in the 
intein. Alternatively, one can specifically mutate residues 
thought to function in or assist the chemical reactions, such as 
the C-terminal splice junction residues, the intein N-terminus, 
the intein penultimate residue, the residues in intein Block B 
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(Perler, Nucleic Acids Res., 25:1087-1093 (1997); Perler, 
Nucleate Acids Res., 27:346-247 (1999); Pietrokovski, supra), 
residues proximal to the intein active site as determined 
crytallographically (Duan, et al. f Cell, 89:555-564 (1997); 
Klabunde, et al., Nat Struct BioL, 5:31-36 (1998)), etc. The 
mutated intein gene is then introduced into a cell and examined 
for the ability to splice under permissive and non-permissive 
temperatures as chosen by the researcher, and can be any 
combination of temperatures (Figure 11). Splicing is assayed as 
in Examples I through III as long as the chromosomal or intein 
minus extein gene is not similarly temperature sensitive. 

Using the Mxe GyrA intein in the E.coli GyrA extein and 
expressing the fusion in E.coli cells (Example I), we have 
identified several polymerase chain reaction generated 
mutations that render splicing of the Mxe GyrA intein 
temperature sensitive (Figures 10, 11, 12 and 13). These 
precursors splice at 19°C, but not at 37°C. Moreover, these 
mutations concentrate in the beta-sheet that includes intein 
Block B (Figures 12 and 13). 

Screening for Temperature Sensitive Mxe GyrA Intein Mutants 

The gyrA selection system described in Example I, can also 
be used to screen for temperature sensitive splicing mutants of 
the Mxe GyrA intein in the ofloxacin sensitive E. coli GyrA extein. 
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Experiments were performed with a vector similar to pEA600. A 
splicing proficient clone and a splicing deficient clone (containing 
mutation of the intein Cys1 to Ala and Asm 98 to Ala) were 
plated on solid media containing various concentrations of 
ofloxacin to determine the appropriate drug concentration to 
allow growth of the splicing deficient clone while blocking growth 
of the splicing proficient clone. The Mxe gyrA intein gene was 
then amplified by PCR (Figure 10) using mutagenic strategies 
known to one skilled in the art and inserted into the E. coli gyrA 
gene. Libraries were plated on solid media containing ofloxacin at 
the predetermined concentration, replica plated and grown at 
either 37°C or 16°C (Figure 11). Only splicing defective clones 
survived and grew on the plates. The replica plates were 
compared to identify clones that grew at 37°C, but not at 16°C. 
Such clones were picked and retested for temperature 
dependent splicing. Alternatively, the libraries of mutated Mxe 
GyrA inteins in E. coli GyrA were grown at 37°C and then 
streaked onto a second plate to test for lack of growth at the 
splicing permissive temperature of 16°C. Splicing of the GyrA 
precursor was examined in clones that failed to grow at 16°C by 
incubating in the absence of ofloxacin at 37°C for 3 hours and 
then shifting to 16°C overnight. Cell lysates were 
electrophoresed in SDS-PAGE gels that were then stained with 
Coomassie blue. Spliced GyrA was observed in several clones, 
although splicing was not complete (Figure 12). 
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The Mxe gyrA intein gene was sequenced from several of 
these temperature sensitive clones and found to have one or 
more mutations which are summarized in Figure 13. The 3-D 
structure of the Mxe GyrA intein is known (Klabunde, et al., 
Nature Struct Biol. 5:31-36 (1998))., GyrA enabling us to place 
these mutations on the Mxe GyrA intein structure (Figure 14). 
We found that many of the mutations were in the beta-sheet 
including intein Block B (Figures 13 and 14), specifically in Mxe 
GyrA intein beta-strand B8 and the loop between beta-strands 
B8 and B9 (Klabunde, supra; Perler Cell 92:1-4 (1998)). Intein 
Block B contains conserved intein residues thought to assist in 
the autocatalytic reactions at the intein N-terminal splice 
junction (Klabunde, supra; Noren, C.J., et al. Angewandte Chemie 
(in press)). Mutation in residues proximal in space to intein Block 
B, as found in this selection for temperature sensitive Mxe GyrA 
intein mutants, may slightly perturb the position of Block B 
residues, resulting in the temperature sensitive phenotype. 

We suggest that mutation of the amino acids in the 
analogous beta-strand and loop in other inteins may generate 
temperature sensitive mutants of any intein. Homologous 
regions in other inteins can be easily identified due to the 
structural similarity of known intein splicing domains and intein 
multiple sequence alignments. To date, the 3-D structure of the 
Mxe GyrA intein (Klabunde, supra), the See VMA intein (Duan, et 
al., Cell 89:555-564 (1997)) and the Drosophila hedgehog protein 
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autoprocessing domain (Hall, et al. Cell, 91:85-97 (1997)) have 
been determined. The splicing domain of both inteins and the N- 
terminal part of the hedgehog autoprocessing domain have the 
same protein fold; the alpha carbon trace of most of the amino 
acids in each of these 3 structures are superimpossible 
(Klabunde, supra; Perler, supra (1998)). Intein amino acid 
sequence similarity comparisons have also been described in the 
literature (Perler supra (1997), Pietrokovski, supra(1994). 
Pietrokovski, Protein Sci. 7:64-71 (1998), Dalgaard, et al., J. 
Comp. Biol. 4:193-214 (1997)). 

Given the similarity in intein splicing domain structure and 
sequence, one skilled in the art should easily be able to identify 
regions in any intein that are analogous to the Mxe GyrA intein 
beta-strand B8 and the loop between beta-strands B8 and B9, 
and using this information, mutate this region to specifically 
generate temperature sensitive protein splicing mutants. 
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WHAT IS CLAIMED IS: 

1 . A positive genetic selection system employing a precursor 
comprising a native intein in its natural or homologous 

5 modifiable extein context for the screening of agents 

which inhibit or activate protein splicing, said selection 
system comprising: (1) a host cell which contains a first 
gene encoding a non-selectable form of a target enzyme, 
and (2) a second gene encoding a selectable form of said 

10 target enzyme which is dominantly cytotoxic upon 

interaction under predetermined selection conditions, said 
second gene containing an intein, wherein the inhibition or 
activation of said selectable form of said target enzyme 
by a given agent affects the viability of said host cell. 

15 

2. The positive genetic selection system of claim 1, wherein 
the activation of said selectable form of said target 
enzyme by a given agent results in the death of the host 
cell. 

20 

3. The positive genetic selection system of claim 1, wherein 
the inhibition of said selectable form of said target 
enzyme by a given agent results in the viability of the host 
cell. 



25 
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4. The positive genetic selection system of claim 1, wherein 
said host cell contains a first gene encoding a drug- 
resistant form of the target enzyme and a second gene 
encoding a drug-sensitive form of the target enzyme 

which is dominantly cytotoxic upon interaction with said 
drug. 



10 



15 



20 



5. The positive genetic selection system of claim 1, wherein 
said first gene encodes a wild type form of said target 
enzyme and said second gene encodes a dominant 
cytotoxic form of said target enzyme. 

6. The positive genetic selection system of claim 4, wherein 
in the absence of a silent mutation of said extein, said 
intein is selected from the group consisting of an intein 
inserted into the drug-sensitive form of said target 
enzyme and a natural intein in a mutated native or 
homologous extein, wherein said mutation renders the 
extein cytotoxic upon interaction with said drug. 



7. 



The positive genetic selection system of claim 1. wherein 
said agent comprises an in vivo peptide library or 
derivatives thereof. 



The positive genetic selection system of claim 4, wherein 
said intein comprises the M. xenopi GyrA intein, said 
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homologous extein comprises E.coli GyrA and said host cell 
comprises E.colL 

9. The positive genetic selection system of claim 8, wherein 
said drug-resistant form of said target enzyme is the 
Ser83 mutant of E.coli GyrA. 

10. The positive genetic selection system of claim 7, wherein 
said peptide library comprises a combinatorial peptide 
library in a fragment of chicken ce-spectrin. 

1 1 . The positive genetic selection system of claim 5, wherein 
in the absence of a silent mutation in said extein, said 
intein is selected from the group consisting of an intein 
inserted in to the cytotoxic form of said target enzyme 
and a natural intein in a mutated native or homologous 
extein, wherein said mutated native extein is cytotoxic. 

12. The positive genetic selection system of claim 11, wherein 
said agent comprises an in vivo peptide library or 
derivatives thereof. 

13. The positive genetic selection system of claim 11, wherein 
said natural intein is the M. tuberculosis DnaB intein and 
said mutated native extein is the M. tuberculosis R231C 
mutant, and wherein said host cell is E. coli. 
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. The positive genetic selection system of claim 13, wherein 
said in vivo peptide library comprises a combinatorial 
peptide library in a fragment of chicken cc-spectrin. 



A method of screening for agents which inhibit protein 
splicing, said method comprising the steps of: 

(a) creating a positive selection system comprising a host 
cell containing a gene encoding a dominantly cytotoxic 
protein containing an intein; and 

(b) culturing the host cell of step (a) in the presence of 
test agents, wherein the inhibition of splicing of said 
cytotoxic protein results in viability of said host cell. 



16. The method of claim 15, wherein said agent is expressed 
within the host cell as a protein or portion thereof and said 
agent is identified by the gene encoding said agent from 
said surviving host. 

17. The method of claim 15, wherein said positive selection 
system of step (a) comprises the positive genetic 
selection wherein said host cell contains a first gene 
encoding a drug-resistant form of the target enzyme and 
a second gene encoding a drug-sensitive form of the 
target enzyme which is dominantly cytotoxic upon 
interaction with said drug, and wherein step (b) further 
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comprises culturing said host cell of step (a) in the 
presence of said drug. 

18. The method of claim 15, wherein said positive selection 
system of step (a) comprises a first gene encoding a wild 
type form of said target enzyme and a second gene 
encoding a dominant cytotoxic form of said target 
enzyme. 

19. The method of claims 17 or 18 , wherein said host cell of 
step (a) expresses an in vivo peptide library or derivatives 
thereof, and wherein said test agents of step (b) comprise 
said in vivo peptide library or derivatives thereof. 

20. The method of claim 19 wherein said in vivo peptide library 
comprises a combinatorial peptide library in a fragment of 
chicken ot-spectrin. 

21. A method for screening of agents which activate protein 
splicing, said method comprising the steps of: 

(a) creating a positive selection system comprising a host 
cell containing a gene encoding a dominantly cytotoxic 
protein, said gene containing an inactive intein; and 

(b) culturing said host cell of step (a) in the presence of 
individual test agents, wherein the activation of splicing of 
said cytotoxic protein results in host cell death. 
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22. The method of claim 21, wherein said agent is controllably 
expressed within the host cell as a protein or portion 
thereof and said agent is identified by the identification of 
the gene encoding said agent in a parallel sample of said 
host cell in which expression of said agent was not 
activated. 

23. The method of claim 21, wherein said positive selection 
system of step (a) comprises the positive genetic 
selection system wherein said host cell contains a first 
gene encoding a drug-resistant form of the target enzyme 
and a second gene encoding a drug-sensitive form of the 
target enzyme which is dominantly cytotoxic upon 
interaction with said drug, and wherein step (b) further 
comprises culturing said positive selection system of step 
(a) in the presence of said drug. 

24. The method of claim 21, wherein said positive selection 
system of step (a) comprises the positive genetic 
selection system wherein the drug-resistant form of said 
target enzyme is the Ser83 mutant of E.coli GyrA. 

25. The method of claims 21 or 22, wherein said host cell of 
step (a) expresses an in vivo peptide library or derivatives 
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thereof, and wherein said test agents of step (b) comprise 
said in vivo peptide library or derivatives thereof. 



26. The method of claim 25, wherein said in vivo peptide 
library comprises a combinatorial peptide library in a 
fragment of chicken a-spectrin. 



10 



15 



27. A positive genetic selection, system for the screening of 
agents which inhibit protein splicing, said selection system 
comprising: (1) a host cell which contains a first gene 
encoding a controllable form of a target enzyme which is 
required for cell viability, and (2) a second gene encoding 
an expressed form of said target enzyme, said second 
gene containing an intein, wherein the inhibition of splicing 
of said target enzyme by a given agent results in the 
reduced viability or death of said host cell under conditions 
which do not permit the expression of said controllable 
first gene of said target enzyme. 



20 



25 



28. The selection system of claim 27, wherein said controllable 
form of said target protein is selected from the group 
consisting of a drug-sensitive target protein, an inducer- 
sensitive target protein, a temperature-sensitive target 
protein, and a target protein operably linked to a 
controllable promoter. 
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29. The selection system of claim 27, wherein said intein is 
selected from the group consisting of a foreign intein 
inserted into the homologous extein of said target enzyme 
and a natural intein in a native extein. 

30. A positive genetic selection system for the screening of 
agents which activate protein splicing, said selection 
system comprising: (1) a host cell which contains a first 
gene encoding a controllable form of a target enzyme 
required for cell viability, and (2) a second gene encoding 
an expressed form of said target enzyme, said second 
gene containing an inactive intein, wherein the activation 
of splicing of said target enzyme by a given agent results 
in the survival of said host cell under conditions which do 
not permit the expression of said controllable first gene of 
said target enzyme. 



31. The selection system of claim 30, wherein said controllable 
target protein is selected from the group consisting of a 
drug-sensitive target protein, an inducer-sensitive target 
protein, a temperature-sensitive target protein, and a 
target protein operably linked to a controllable promoter. 

32. The selection system of claim 30, wherein said inactive 
intein is selected from the group consisting of a foreign 
inactive intein inserted into the homologous extein of said 
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target enzyme and a natural inactive intein in a native 
extein. 

33. The method of claims 27 or 30, wherein said host cell of 
step (a) expresses an in vivo peptide library or derivatives 
thereof, and wherein said test agents of step (b) comprise 
said in vivo peptide library or derivatives thereof. 

34. The method of claim 33, wherein said in vivo peptide 
library comprises a combinatorial peptide library in a 
fragment of chicken a-spectrin. 

35. A method of screening for agents which inhibit protein 
splicing, said method comprising the steps of: 

(a) culturing the positive selection system of claim 27 in 
the presence of test agents under non-permissive 
conditions; and 

(b) identifying non-surviving host cells from step (a), 
wherein said agent inhibits protein splicing. 

36. The method of claim 35, wherein said agent is expressed 

within the host cell as a protein or portion thereof and 
wherein said non-surviving host cells of step (b) contain 
test agents which inhibit protein splicing. 



WO 01/32831 



PCT/USOO/29596 



- 91 - 



37. A method of screening for agents which activate protein 
splicing, said method comprising the steps of: 

(a) culturing the positive selection system of claim 30 in 
the presence of test agents under non-permissive 
conditions; and 

(b) identifying surviving host cells from step (a), wherein 
said agent activates protein splicing. 

38. The method of claim 37, wherein said agent is expressed 

within the host cell as a protein or portion thereof and 
wherein said surviving host cells of step (b) contain test 
agents which activate protein splicing. 

39. The method of claims 36 or 38, wherein said host cell of 
step (a) expresses an in vivo peptide library or derivatives 
thereof, and wherein said test agents of step (a) comprise 
said in vivo peptide library or derivatives thereof. 

40. The method of claim 39, wherein said in vivo peptide 
library comprises a combinatorial peptide library in a 
fragment of chicken a-spectrin. 



41 



A reporter system for the screening of agents which 
inhibit protein splicing, said reporter system comprising a 
host cell which contains a reporter gene encoding a non- 
essential protein, said reporter gene containing an intein, 
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wherein said intein is selected from the group consisting 
of a foreign intein inserted into the homologous or non- 
homologous extein of said reporter gene in the absence of 
a silent mutation of said extein and a natural intein in a 
native reporter extein, and wherein the inhibition of 
splicing of said non-essential protein by a given agent 
results in a specific detectable phenotype of said host 
cell. 

42. A reporter system for the screening of agents which 
activate protein splicing, said reporter system comprising 
a host cell which contains a reporter gene encoding a non- 
essential protein, said reporter gene containing an inactive 
intein, wherein said inactive intein is selected from the 
group consisting of a foreign inactive intein inserted into 
the homologous or non-homologous extein of said reporter 
gene in the absence of a silent mutation of said extein and 
a natural inactive intein in a native reporter extein, and 
wherein the activation of splicing of said non-essential 
protein by a given agent results in a specific selectable 
phenotype of said host cell. 

43. The method of screening for agents which inhibit protein 

splicing, said method comprising the steps of: 

(a) culturing the reporter system of claim 41 in the 

presence of test agents; and 
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(b) identifying host cells from step (a) having a specific 
detectable phenotype, wherein said host cells with 
detectable phenotype are in the presence of test agents 
which inhibit protein splicing. 

The method of claim 43 wherein said agent is expressed 
within the host cell as a protein or portion thereof and 
wherein step (b) comprises identifying host cells from 
step (a) having a specific detectable phenotype, wherein 
said host cells with detectable phenotype contain test 
agents which inhibit protein splicing. 



A method of screening for agents which activate protein 
splicing, said method comprising the steps of: 

(a) culturing the reporter system of claim 42 in the 
presence of test agents; and 

(b) identifying host cells from step (a) having a specific 
detectable phenotype, wherein said host cells with 
detectable phenotype are in the presence of test agents 
which activate protein splicing. 



The method of claim 45 wherein said agent is expressed 
within the host cell as a protein or portion thereof and 
wherein step (b) comprises identifying host cells from 
step (a) having a specific detectable phenotype, wherein 
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said host cells with detectable phenotype contain test 
agents which activate protein splicing. 

47. The method of claims 44 or 46, wherein said host cell of 
step (a) expresses an in vivo peptide library or derivatives 
thereof, and wherein said test agents of step (a) comprise 
said in vivo peptide library or derivatives thereof. 

48. The method of claim 47 f wherein said in vivo peptide 
library comprises a combinatorial peptide library in a 
fragment of chicken cc-spectrin. 

49. A method of controlling gene expression in vivo, said 
method comprising the steps of: 

(a) replacing a homologous extern gene in a host cell with a 
gene containing an intein; and 

(b) modulating the splicing of the intein-containing gene of 
step (a) with agents which inhibit or activate said splicing. 

50. The method of claim 49, wherein said replacement of step 
(a) comprises inserting said intein gene into said 
homologous extein gene by homologous recombination. 

51. The method of claim . 49, wherein said replacement of step 
(a) comprises inactivating said homologous extein gene and 
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said intein-containing gene is introduced as a second gene 
in the cell. 

52. A method of controlling the delivery of a drug that is at 
least partially proteinaceous In vivo, said method 
comprising the steps of: 

(a) inserting an intein into the protein portion of said drug 
to create an inactive drug; 

(b) administering said inactive drug of step (a); and 

(c) activating protein splicing of said inactive drug to 
produce an active drug. 

The method of claim 52, wherein step (b) further 
comprises utilizing gene therapy to target said inactive 
drug to a desired tissue. 

The method of claim 52, wherein said intein of step (a) 
comprises a temperature-sensitive intein, and wherein said 
activation of step (c) comprises exposing a desired target 
tissue to any treatment which increases or decreases 
temperature of said target tissue thus inducing splicing of 
said temperature-sensitive intein. 



53. 



54. 



55. 



The method of claim 52, wherein said activation of step 
(c) comprises injecting a desired target tissue with an 
agent which activates splicing. 
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56. The method Of claim 52, wherein step (a) further 

comprises utilizing gene therapy to target an agent which 
activates splicing to a desired tissue, and wherein said 
administration of step (b) comprises systemic 
administration. 



57. The method of claim 52, wherein said administration of 
step (b) comprises systemic transformation with a non- 
cell specific vector containing said inactive drug of step 
(a) operably linked to a desired tissue-specific promoter, 
and wherein said inactive drug is expressed only in cells 
which can activate said tissue-specific promoter. 

58. A method for generating temperature sensitive mutants 
of the Mxe GyrA intein in E.coli GyrA, said method 
comprising the steps of: 

(a) identification of the region containing the Mxe GyrA 
intein beta-strand B8 and the loop between Beta-strands 
B8 and B9 in E.coll GyrA; 

(b) mutating said region; and 

(c) introducing the mutated intein gene into a cell and 
examining the ability to splice under permissive and non- 
permissive temperatures. 
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59. A method for generating temperature sensitive mutants 
of an intein, said method comprising the steps of: 

(a) identification of a region homologous to the Mxe GyrA 
intein beta-strand B8 and the loop between beta-strands 
B8 and B9 in an intein; 

(b) mutating said homologous region of said second intein; 
and 

(c) introducing the mutated intein gene into a cell and 
examining the ability to splice under permissive and non- 
permissive temperatures. 

60. A method of screening for temperature-sensitive inteins, 
said method comprising the steps of 

(a) creating a positive selection system comprising a host 
cell containing a gene encoding a dominantly cytotoxic 
protein, said gene containing an intein, and wherein said 
intein is mutagenized; and 

(b) culturing the host cell of step (a) at a range of 
temperatures, wherein at a predetermined temperature, 
the protein fails to splice and results in viability of said 
host cell. 

61. A method of screening for temperature-sensitive inteins, 
said method comprising the steps of 

(a) creating a positive selection system comprising a host 
cell containing a gene encoding a dominantly cytotoxic 
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protein, said gene containing an inactive intein, and wherein 
said intein is mutagenized; and 
(b) culturing the host cell of step (a) at a range of 
temperatures, wherein at a predetermined temperature, 
5 the intein splices and results in the death of the host cell. 

62. A positive selection system for the screening of 
temperatures which inhibit protein splicing in temperature- 
sensitive inteins, said selection system comprising (1) a 

10 host cell which contains a first gene encoding a 

controllable form of a target enzyme which is required for 
cell viability, and (2) a second gene encoding an expressed 
form of said target enzyme, said second gene containing 
an intein, wherein said intein is mutagenized, and wherein 

15 the inhibition of splicing of said target enzyme by a given 

non-permissive temperature results in the reduced 
viability or death of said host cell under conditions which 
do not permit the expression of said controllable first 
gene of said target enzyme. 

20 

63. A positive selection system for the screening of 
temperatures which activate protein splicing in 
temperature-sensitive inteins, said selection system 
comprising (1) a host cell which contains a first gene 

25 encoding a controllable form of a target enzyme which is 

required for cell viability, and (2) a second gene encoding 
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an expressed form of said target enzyme, said second 
gene containing an intein, wherein said intein is 
mutagenized, and wherein the activation of splicing of said 
target enzyme by a given permissive temperature results 
in the viability of said host cell under conditions which do 
not permit the expression of said controllable first gene of 
said target enzyme. 

64. A method of screening for temperatures which inhibit 
protein splicing in temperature-sensitive inteins, said 
method comprising the steps of: 

(a) culturing the positive selection system of claim 62 in 
the presence of non-permissive temperatures; and 

(b) identifying non-surviving host cells from step (a), 
wherein said temperature inhibits protein splicing. 

65. A method of screening for temperatures which activate 
protein splicing in temperature-sensitive inteins, said 
method comprising the steps of: 

(a) culturing the positive selection system of claim 63 in 
the presence of permissive temperatures; and 

(b) identifying surviving host cells from step (a), wherein 
said temperature activates protein splicing. 

66. A reporter system for the screening of temperatures 
which inhibit protein splicing in temperature-sensitive 
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inteins., said reporter system comprising a host cell which 
contains a reporter gene encoding a non-essential protein, 
said reporter gene containing an intein, wherein said intein 
is selected from the group consisting of a foreign intein 
5 inserted into the homologous or non-homologous extein of 

said reporter gene in the absence of a silent mutation of 
said extein and a natural intein in a native reporter extein, 
and wherein said intein is mutagenized, and wherein the 
inhibition of splicing of said non-essential protein by a non- 
10 permissive temperature results in a specific detectable 

phenotype. 



67. A reporter system for the screening of temperatures 
which activate protein splicing in temperature-sensitive 

15 inteins, said reporter system comprising a host cell which 

contains a reporter gene encoding a non-essential protein, 
said reporter gene containing an intein, wherein said intein 
is selected from the group consisting of a foreign intein 
inserted into the homologous or non-homologous extein of 

20 , said reporter gene in the absence of a silent mutation of 

said extein and a natural intein in a native reporter extein, 
and wherein said intein is mutagenized, and wherein the 
activation of splicing of said non-essential protein by a 
permissive temperature results in a specific detectable 

25 phenotype. 
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68. A method for screening for temperatures which inhibit 
protein splicing in temperature-sensitive inteins, said 
method comprising the steps of: 

(a) culturing the reporter system of claim 66 in the 
presence of a range of temperatures; and 

(b) identifying host cells from step (a) having a specific 
detectable phenotype, wherein said host cells with 
detectable phenotype are in the presence non-permissive 
temperatures which inhibit protein splicing. 



69. A method for screening for temperatures which activate 
protein splicing in temperature-sensitive inteins, said 
method comprising the steps of: 

(a) culturing the reporter system of claim 67 in the 
presence of a range of temperatures: and 

(b) identifying host cells from step (a) having a specific 
detectable phenotype, wherein said host cells with 
detectable phenotype are in the presence permissive 
temperatures which activate protein splicing. 

70. The positive genetic selection system of claim 6, 11, 29 < 
32, wherein the extein is a heterologous extein and the 
intein is inserted into the extein at a site which is 
substantially identical to the homologous extein from 
about one to about five amino acid residues at either or 
both ends of the intein. 
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71. The reporter system of claim 41, wherein the extein is a 
heterologous extein and the intein is inserted into the 
extein at a site which is substantially identical to the 
homologous extein from about one to five amino acid 
residues at either or both ends of the intein. 

72. The positive selection system of claim 6, 11, 29 or 32, 
wherein the extein is a heterologous extein and one to five 
amino acid residues of the native extein are present at 
one or both ends of the intein and said one to five amino 
acid residues are inserted into the heterologous extein 
along with the intein. 

73. The reporter system of claim 41, wherein the extein is a 
heterologous extein and one to five amino acid residues of 
the native extein are present at one or both ends of the 
intein and said one to five amino acid residues are inserted 
into the heterologous extein along with the intein. 

74. The positive genetic selection system of claims 6, 11, 29 
or 32, wherein the insertion site is selected from the 
group consisting essentially of a surface location on the 
extein, a loop region of the extein, a protease sensitive 
site within the extein, or at a position known to permit 
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insertion of one or more amino acid residues in the extein 
without inactivating the extein. 



75. The positive genetic selection system of claim 70, wherein 

m 

the insertion site is selected from the group consisting 
essentially of a surface location on the extein, a loop 
region of the extein, a protease sensitive site within the 
extein, or at a position known to permit insertion of one or 
more amino acid residues in the extein without inactivating 
the extein. 



76. The positive genetic selection system of claim 72, wherein 
the insertion site is selected from the group consisting 
essentially of a surface location on the extein, a loop 
region of the extein, a protease sensitive site within the 
extein, or at a position known to permit insertion of one or 
more amino acid residues in the extein without inactivating 
the extein. 

77. The reporter system of claim 41 or 71, wherein the 
insertion site is selected from the group consisting 
essentially of a surface location on the extein, a loop 
region of the extein, a protease sensitive site within the 
extein, or at a position known to permit insertion of one or 
more amino acid residues in the extein without inactivating 
the extein. 
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78. A positive genetic selection system for screening of 
agents which inhibit protein splicing, said selection system 
comprising: 

5 i) a host which contains a first gene encoding an 

inactivated form of a target enzyme which is required for 
cell viability under predetermined conditions; and 
ii) a second gene encoding an expressed form of said 
target enzyme, said second gene containing an intern, 

10 wherein the inhibition of splicing of said target enzyme by 

a given agent results in the reduced viability of said host 
cell under said predetermined conditions wherein 
expression of said target enzyme is required for viability 
or growth. 

15 

79. A positive selection system for the screening of 
temperatures which inhibit protein splicing in temperature 
sensitive inteins, said selection system comprising: 

i) a host cell which contains a first gene encoding an 
20 inactivated form of the target enzyme which is required 

for cell viability under predetermined conditions; and 

ii) a second gene encoding an expressed form of said 
target enzyme, said second gene containing an intein, 
wherein said intein is mutagenized and wherein the 

25 inhibition of splicing of said target enzyme at one or more 

predetermined temperatures results in the reduced 
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viability of said host cell under said predetermined 
conditions wherein expression of said target enzyme is 
required for viability or growth. 

80. A method of identifying an agent for antimicrobial activity 
against a microbial pathogen that naturally has an intein in 
an essential gene comprising screening for agents that 
block splicing of that intein in its native context or a 
homologous extein context. 

81. A method of identifying an agent for antimicrobial activity 
against a microbial pathogen that naturally has an intein in 
an essential gene comprising screening for agents that 
block splicing of that intein in a heterologous extein 
context that includes one or more native extein residues 
flanking one or both ends of the intein. 

82. A method of identifying agents with antimicrobial activity 
against Mycobacterium tuberculosis comprising screening 
for agents that inhibit splicing of the Mycobacterium 
tuberculosis DnaB intein. 

83. A method of identifying agents with antimicrobial activity 
against Mycobacterium leprae comprising screening for 
agents that inhibit splicing of the Mycobacterium xenopi or 
Mycobacterium leprae GyrA inteins. 
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84. A method of identifying lead compounds with antimicrobial 
activity comprising identifying agents that inhibit splicing 
of an intein which is naturally present in that organism. 
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FIG. 1 

A TYPICAL PROTEIN SPLICING PRECURSOR AND PROOUCTS 
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FIG. 2 
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FIG. 3A 
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on LB agar ♦ Drug (Ofloxacin) 
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Plate E. coli transformants 
on LB agar ♦ Drug (Ofloxacin) 
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FIG. 4B 
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FIG. 5B 
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FIG. 5C 




Anti-T7 tag Western blot (T7 Tag at N-terminus of Mtu OnaB extein). 

Black Boxes = extein, white box = intein. Top band = precursor. 

middle band ■ C-terminal cleavage product, bottom band = spliced exteins 
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FIG. 7 



The effect of the single amino acid preceding the Mxe GyrA intein in a 
heterologous extein context on splicing and N-terminal cleavage by OTT . 
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a The Mxe GyrA intein was inserted between the maltose binding protein 
and a fragment of Oirofilaria immitis paramyosin (Telenti, et. al., (1997). 
J Bacteriol 179. G37B-B2; Southworth. et, aL. (1999). BioTechniques (in 
press)). This fusion protein was expressed in E. coli at 19°C to examine 
splicing or at 37°C (a non-permissive temperature for splicing of this 
precursor) to examine activation of the N-terminal splice junction for 
cleavage by DTT (dithiothreitol) . Samples expressed at 37°C were 
incubated at 19°C in the presence of 50 mM OTT and (1) the rate of OTT 
cleavage at the N-terminal splice junction was quantitated as a percent 
of the DTT cleavage rate of each precursor compared to the precursor 
with the wild type -1 amino acid (Tyr) preceding the intein or (2) the 
amount of cleaved product after an overnight incubation in DTT. (ND = 
not determined) . 
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FIG. 8 
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FIG. 12 

TEHPERATURE SENSITIVE Mxe GyrA IMTEIN KUTANTS 
ONLY SPLICE WHEN SHIFTEO TO LOWER TEMPERATURE 



_ vrt 

A 




It. 




Spliced GyrA intein[ 



1 - 




mutants 



i 




stffi 




r 




Precursor 

Spliced GyrA extein 



IS."* 



.-.«.-» 



Lanes A : Induction at nonpermissive temperature (37 °C) . 3 hours. 
Other lanes : Induction 37 "C. 3 hours, then shift to IB °C overnight 
Black Boxes : E. coli 6yrA extein. White Box : Hxe GyrA intein 
Cooraassie blue stained gel. 
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FIG. 13 
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Temperature sensitive mutations are indicated below the wild type sequence 
Lower case letters indicate two amino acid mutations in one clone. 
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SEQUENCE LISTING 



<110> NEW ENGLAND BIOLABS, INC. 

<120> Screening Ana Use Of Reagents Which Block Or Activate 
Intein Splicing Utilizing Natural Or Homologous Extei: 

<130> NEB-155-PCT 



<140> 
<141> 



<150> 09/430,221 
<151> 1999-10-29 

<160> 46 

<170> Patentln Ver. 2.0 



<210> 1 
<211> 186 
<212> PRT 

<213> Escherichia coli Gyrase A 
<400> 1 

Met Ser Asp Leu Ala Arg Glu lie Thr Pro Val Asn He Glu Glu Glu 

Leu Lys Ser Ser Tyr Leu Asp Tyr Ala Met Ser Val He Val Gly Arg 

20 25 3 0 

Ala Leu Pro Asp Val Arg Asp Gly Leu Lys Pro Val His Arg Arg Val 
35 40 45 

Leu Tyr Ala Met Asn Val Leu Gly Asn Asp Trp Asn Lys Ala Tyr Lys 
50 55 60 

Lys Ser Ala Arg Val Val Gly Asp Val He Gly Lys Tyr His Pro His 
65 7 ° 75 80 

Gly Asp Ser Ala Val Tyr Asp Thr He Val Arg Met Ala Gin Pro Phe 

85 90 95 

Ser Leu Arg Tyr Met Leu Val Asp Gly Gin Gly Asn Phe Gly Ser He 

100 105 110 

Asp Gly Asp Ser Ala Ala Ala Met Arg Tyr Thr Glu He Arg Leu Ala 
H5 120 125 



1 
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Lys lie Ala His Glu Leu Met Ala Asp Leu Glu Lys Glu Thr Val Asp 
130 135 140 

Phe Val Asp Asn Tyr Asp Gly Thr Glu Lys lie Pro Asp Val Met Pro 
145 150 155 160 

Thr Lys lie Pro Asn Leu Leu Val Asn Gly Ser Ser Gly lie Ala Val 

165 170 175 

Gly Met Ala Thr Asn lie Pro Pro His Asn 

180 185 



<210> 2 
<211> 127 
<212> PRT 

<213> Partial Mycobacterium xenopi GyrA 
<400> 2 

Asp Arg Ser His Ala Lys Ser Ala Arg Ser Val Ala Glu Thr Met Gly 
1 5 10 15 

Asn Tyr His Pro His Gly Asp Ala Ser lie Tyr Asp Thr Leu Val Arg 

20 25 30 



Met Ala Gin Pro Trp 
35 

Asn Phe Gly Ser Pro 
50 

Glu Ala Pro Leu Thr 
65 

Glu Glu Thr Val Asp 

85 



Ser Met Arg Tyr Pro Leu 
40 

Gly Asn Asp Pro Pro Ala 
55 

Pro Leu Ala Met Glu Met 
70 75 

Phe lie Pro Asn Tyr Asp 

90 



Val Asp Gly Gin Gly 
45 

Ala Met Arg Tyr Thr 
60 

Leu Arg Glu lie Asp 

80 

Gly Arg Val Gin Glu 

95 



Pro Thr Val Leu Pro Ser Arg Phe 

100 

Gly Gly lie Ala Val Gly Met Ala 
115 120 



Pro Asn Leu Leu Ala Asn Gly Ser 
105 110 

Thr Asn lie Pro Pro His Asn 

125 



<210> 3 
<211> 438 
<212> PRT 



2 
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<213> Escherichia coli DnaB 
<400> 3 

Pro Pro His Ser lie Glu Ala Glu Gin Ser Val Leu Gly Gly Leu Met 
1 5-10 is 

Leu Asp Asn Glu Arg Trp Asp Asp Val Ala Glu Arg Val Val Ala Asp 

20 25 30 

Asp Phe Tyr Thr Arg Pro His Arg His He Phe Thr Glu Met Ala Arg 
35 40 45 

Leu Gin Glu Ser Gly Ser Pro He Asp Leu He Thr Leu Ala Glu Ser 
50 55 60 

Leu Glu Arg Gin Gly Gin Leu Asp Ser Val Gly Gly Phe Ala Tyr Leu 
65 70 75 80 

Ala Glu Leu Ser Lys Asn Thr Pro Ser Ala Ala Asn He Ser Ala Tyr 

85 90 95 

Ala Asp He Val Arg Glu Arg Ala Val Val Arg Glu Met He Ser Val 

100 105 no 

Ala Asn Glu He Ala Glu Ala Gly Phe Asp Pro Gin Gly Arg Thr Ser 
115 120 125 

Glu Asp Leu Leu Asp Leu Ala Glu Ser Arg Val Phe Lys He Ala Glu 
130 135 140 

Ser Arg Ala Asn Lys Asp Glu Gly Pro Lys Asn He Ala Asp Val Leu 
145 150 155 iso 

Asp Ala Thr Val Ala Arg He Glu Gin Leu Phe Gin Gin Pro His Asp 

165 ivo 175 

Gly Val Thr Gly Val Asn Thr Gly Tyr Asp Asp Leu Asn Lys Lys Thr 

180 185 190 

Ala Gly Leu Gin Pro Ser Asp Leu He He Val Ala Ala Arg Pro Ser 
195 200 205 

Met Gly Lys Thr Thr Phe Ala Met Asn Leu Val Glu Asn Ala Ala Met 
210 215 220 

Leu Gin Asp Lys Pro Val Leu He Phe Ser Leu Glu Met Pro Ser Glu 
225 230 235 240 



3 
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Gin lie Met Met Arg Ser Leu Ala Ser 

245 

Lys lie Arg Thr Gly Gin Leu Asp Asp 

260 265 
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Leu Ser Arg Val Asp Gin Thr 
250 , 255 

Glu Asp Trp Ala Arg He Ser 

270 



Gly Thr Met Gly 
275 

Ser Ser Gly Leu 
290 - 

Ala Arg Glu His 
305 



He Leu Leu Glu 

280 

Thr Pro Thr Glu 
295 

Gly Gly He Gly 
310 



Lys Arg Asn He 



Val Arg Ser Arg 

300 

Leu He Met He 
315 



Tyr He Asp Asp 
285 

Ala Arg Arg He 



Asp Tyr Leu Gin 

320 



Leu Met Arg Val 



Glu He Ser Arg 

340 



Pro Ala Leu Ser 
325 

Ser Leu Lys Ala 



Asp Asn Arg Thr 
330 

Leu Ala Lys Glu 
345 



Leu Glu He Ala 
335 

Leu Asn Val Pro 
350 



Val Val Ala Leu Ser Gin Leu Asn Arg Ser Leu Glu Gin Arg Ala Asp 
355 360 365 

Lys Arg Pro Val Asn Ser Asp Leu Arg Glu Ser Gly Ser lie Glu Gin 
370 375 380 

Asp Ala Asp Leu He Met Phe He Tyr Arg Asp Glu Val Tyr His Glu 
385 390 395 400 



Asn Ser Asp Leu Lys Gly He Ala Glu He He He Gly Lys Gin Arg 

405 410 415 

Asn Gly Pro He Gly Thr Val Arg Leu Thr Phe Asn Gly Gin Trp Ser 

420 425 430 

Arg Phe Asp Asn Tyr Ala 
435 



<210> 4 
<211> 434 
<212> PRT 

<213> Partial Mycobacterium tuberculosis DnaB 
<400> 4 

Pro Pro Gin Asp Leu Ala Ala Glu Gin Ser Val Leu Gly Gly Met Leu 
15 10 15 



4 
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Leu Ser Lys Asp 

20 

Asp Phe Tyr Arg 
35 

Leu Tyr Gly Arg 
50 

Leu Asp Arg Arg 
65 

His Thr Leu lie 



Ala Ser lie Val 

100 

Gly Thr Arg Val 
115 

Val Ala Glu val 
130 

Asp Arg Arg Leu 
145 

Pro Thr Met Asp 



Arg Gly Val Ala 

180 

Leu His Pro Gly 
195 

Lys Ser Thr Leu 
210 

Arg Met Ala Ser 
225 

Val Met Arg Leu 



Ala lie Ala Asp 



Pro Ala His Gin 

40 

Gly Glu Pro Ala 
55 

Gly Leu Leu Arg 
70 

Ser Thr Val Pro 
85 

Ala Glu Lys Ala 



Val Gin Tyr Gly 

120 

Val Asp Arg Ala 
135 

Ser Glu Asp Phe 
150 

Glu He Asp Ala 
165 

Thr Gly Phe Thr 



Gin Met Val He 

200 

Gly Leu Asp Phe 
215 

Val He Phe Ser 
230 

Leu Ser Ala Glu 
245 



Val Leu Glu Arg 
25 

Asn Val Tyr Asp 



Asp Ala Val Thr 

60 

Arg He Gly Gly 
75 

Thr Ala Ala Asn 
90 

Leu Leu Arg Arg 
105 

Tyr Ala Gly Ala 

Gin Ala Glu He 

140 

Val Ala Leu Glu 
155 

He Ala Ser Ser 
170 

Glu Leu Asp Glu 
185 

Val Ala Ala Arg 

Met Arg Ser Cys 

220 

Leu Glu Met Ser 
235 

Ala Lys He Lys 
250 



Leu Arg Pro Gly 
30 

Ala He Leu Asp 
45 

Val Ala Ala Glu 



Ala Pro Tyr Leu 

80 

Ala Gly Tyr Tyr 
95 

Leu Val Glu Ala 
110 

Glu Gly Ala Asp 
125 

Tyr Asp Val Ala 



Asp Leu Leu Gin 

160 

Gly Gly Leu Ala 
175 

Val Thr Asn Gly 
190 

Pro Gly Val Gly 
205 

Ser He Arg His 



Lys Ser Glu He 

240 

Leu Ser Asp Met 
255 



Arg Ser Gly Arg Met Ser Asp Asp Asp Trp Thr Arg Leu Ala Arg Arg 

260 265 270 
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Met Ser Glu lie 
275 

Leu Thr Met Met 
290 

Ala Asn Leu Lys 
305 

Gly Lys Lys Tyr 



Ser Glu Ala Pro 

280 

Glu lie Arg Ala 
295 

Leu He Val Val 
310 

Glu Ser Arg Gin 
325 



Leu Phe He. Asp 



Lys Ala Arg Arg 

300 

Asp Tyr Leu Gin 
315 

Val Glu Val Ser 
330 



Asp Ser Pro Asn 
285 

Leu Arg Gin Lys 



Leu Met Thr Ser 

320 

Glu Phe Ser Arg 
335 



His Leu Lys Leu Leu Ala Lys Glu 

340 

Ser" Gin Leu Asn Arg Gly Pro Glu 
355 360 



Leu Glu Val Pro Val Val Ala He 
345 350 

Gin Arg Thr Asp Lys Lys Pro Met 

365 



Leu Ala Asp Leu Arg 
370 

Val He Leu Leu His 
385 

Gly Gly Glu Ala Asp 

405 

Lys Thr Val Thr Val 

420 

Met Ala 



Glu Ser Gly Ser Leu Glu 
375 

Arg Pro Asp Ala Phe Asp 
390 395 

Phe He Leu Ala Lys His 

410 

Ala His Gin Leu His Leu 

425 



Gin Asp Ala Asp Val 
380 

Arg Asp Asp Pro Arg 

400 

Arg Asn Gly Pro Thr 

415 

Ser Arg Phe Ala Asn 
430 



<210> 5 
<211> 7 
<212> PRT 

<213> Mycobacterium tuberculosis DnaB 
<400> 5 

Thr Val Gin Ser Thr Lys Arg 
1 5 



<210> 6 
<211> 7 
<212> PRT 
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<213> Mycobacterium tuberculosis DnaB 
<400> 6 

Arg Pro Ala Pro Arg Pro Leu 
15 

<210> 7 
<211> 7 
<212> PRT 

<213> Mycobacterium tuberculosis DnaB 
<400> 7 

Pro Thr Ala Arg Thr Tyr Glu 
1 5 

<210> 8 
<211> 12 
<212> PRT 

<213> Mycobacterium tuberculosis DnaB 
<400> 8 

Pro Thr Arg Pro Thr Ala Pro Pro Leu Asn Phe Ser 
1 5 10 



<210> 9 
<211> 12 
<212> PRT 

<213> Mycobacterium tuberculosis DnaB 
<400> 9 

His Pro Asn Pro His Pro Thr Leu Ser Gly Gin Arg 
1 5 10 



<210> 10 
<211> 7 
<212> PRT 

<213> Mycobacterium tuberculosis DnaB 
<400> 10 

Asp Leu Pro Met Val Glu Glu 
1 5 



<210> 11 



WO 01/32831 
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<211> 198 
<212> PRT 

<213> Mycobacterium xenopi Gyrase A intein 



<400> 11 

Cys He Thr Gly Asp Ala Leu Val Ala Leu Pro Glu Gly Glu Ser Val 
15 10 15 

Arg He Ala Asp He Val Pro Gly Ala Arg Pro Asn Ser Asp Asn Ala 

20 25 30 

He Asp Leu Lys Val Leu Asp Arg His Gly Asn Pro Val Leu Ala Asp 
35 40 45 

Arg Leu Phe His Ser Gly Glu His Pro Val Tyr Thr Val Arg Thr Val 
50 55 60 

Glu Gly Leu Arg Val Thr Gly Thr Ala Asn His Pro Leu Leu Cys Leu 
65 70 75 80 

Val Asp Val Ala Gly Val Pro Thr Leu Leu Trp Lys Leu He Asp Glu 

85 90 95 

He Lys Pro Gly Asp Tyr Ala Val He Gin Arg Ser Ala Phe Ser Val 

100 105 110 

Asp Cys Ala Gly Phe Ala Arg Gly Lys Pro Glu Phe Ala Pro Thr Thr 
115 120 125 

Tyr Thr Val Gly Val Pro Gly Leu Val Arg Phe Leu Glu Ala His His 
130 135 140 

Arg Asp Pro Asp Ala Gin Ala He Ala Asp Glu Leu Thr Asp Gly Arg 
145 150 155 160 

Phe Tyr Tyr Ala Lys Val Ala Ser Val Thr Asp Ala Gly Val Gin Pro 

165 170 175 

Val Tyr Ser Leu Arg Val Asp Thr Ala Asp His Ala Phe He Thr Asn 

180 185 190 

Gly Phe Val Ser His Asn 
195 



<210> 12 
<211> 85 
<212> PRT 



8 
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<213> Gallus gallus alpha-spec trin fragment 
<400> 12 

Met Arg Asn Thr Thr Gly Val Thr Glu Glu Ala Leu Lys Glu Phe Ser 
1 5 io 15 

Met Met Phe Lys His Phe Asp Lys Asp Lys Ser Gly Arg Leu Asn His 

20 25 30 

Gin Glu Phe Lys Ser Cys Leu Arg Ser Leu Gly Tyr Asp Leu Pro Met 
35 40 45 

Val Glu Glu Gly Glu Pro Asp Pro Glu Phe Glu Ser lie Leu Asp Thr 
50 55 60 

Val Asp Pro Asn Arg Asp Gly His Val Ser Leu Gin Glu Tyr Met Ala 

65 70 -7C 

/u 75 30 

Phe Met lie Ser Arg 

85 



-•a 

■ 

■ 



a. 



<210> 13 
<211> 416 
<212> PRT 

<213> Mycobacterium tuberculosis DnaB intein 
<400> 13 

Cys Leu Thr Ala Ser Thr Arg He Leu Arg Ala Asp Thr Gly Ala Glu 

Val Ala Phe Gly Glu Leu Met Arg Ser Gly Glu Arg Pro Met Val Tro 

20 25 30 

Ser Leu Asp Glu Arg Leu Arg Met Val Ala Arg Pro Met He Asn Val 
35 40 45 

Phe Pro Ser Gly Arg Lys Glu Val Phe Arg Leu Arg Leu Ala Ser Gly 
50 55 60 

Arg Glu Val Glu Ala Thr Gly Ser His Pro Phe Met Lys Phe Glu Gly 
65 70 75 so 

Trp Thr Pro Leu Ala Gin Leu Lys Val Gly Asp Arg He Ala Ala Pro 

85 90 95 

Arg Arg Val Pro Glu Pro lie Asp Thr Gin Arg Met Pro Glu Ser Glu 

100 105 xl0 



9 
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Leu lie Ser Leu Ala Arg Met lie Gly Asp Gly Ser Cys Leu Lys Asn 
115 120 125 

Gin Pro lie Arg Tyr Glu Pro Val Asp Glu Ala Asn Leu Ala Ala Val 
130 135 140 

Thr Val Ser Ala Ala His Ser Asp Arg Ala Ala lie Arg Asp Asp Tyr 
145 150 155 160 

Leu Ala Ala Arg Val Pro Ser Leu Arg Pro Ala Arg Gin Arg Leu Pro 

165 170 175 

Arg Gly Arg Cys Thr Pro lie Ala Ala Trp Leu Ala Gly Leu Gly Leu 

180 185 190 

Phe Thr Lys Arg Ser His Glu Lys Cys Val Pro Glu Ala Val Phe Arg 
195 200 205 

Ala Pro Asn Asp Gin Val Ala Leu Phe Leu Arg His Leu Trp Ser Ala 
210 215 220 

Gly Gly Ser Val Arg Trp Asp Pro Thr Asn Gly Gin Gly Arg Val Tyr 
225 230 235 240 

Tyr Gly Ser Thr Ser Arg Arg Leu lie Asp Asp Val Ala Gin Leu Leu 

245 250 255 

Leu Arg Val Gly lie Phe Ser Trp lie Thr His Ala Pro Lys Leu Gly 

260 265 270 

Gly His Asp Ser Trp Arg Leu His lie His Gly Ala Lys Asp Gin Val 
275 280 285 

Arg Phe Leu Arg His Val Gly Val His Gly Ala Glu Ala Val Ala Ala 
290 295 300 

Gin Glu Met Leu Arg- Gin Leu Lys Gly Pro Val Arg Asn Pro Asn Leu 
305 310 315 320 

Asp Ser Ala Pro Lys Lys Val Trp Ala Gin Val Arg Asn Arg Leu Ser 

325 330 335 

Ala Lys Gin Met Met Asp lie Gin Leu His Glu Pro Thr Met Trp Lys 

340 345 350 



His Ser Pro Ser Arg Ser Arg Pro His Arg Ala Glu Ala Arg He Glu 
355 360 365 
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Asp Arg Ala He 
370 

Val Val Glu lie 
385 

Val Ser Gly Thr 



His Glu Leu Ala 
375 

Thr Ser lie Gly 
390 

His Asn Phe Val 
405 



Arg Gly Asp Ala 

380 

Asp Gin His Val 
395 

Ala Asn Gly lie 
410 



Tyr Trp Asp Thr 



Phe Asp Gly Thr 

400 

Ser Leu His Asn 
415 



<210> 14 
<211> 8 
<212> PRT 

<213> Mycobacterium xenopi Gyrase A 
<400> 14 

Asp Ser Ala Ala Ala Met Arg Tyr 
1 5 



<210> 15. 
<211> 31 
<212> DNA 

<213> Escherichia coli Gyrase A 
<400> 15 

gataggctag cgatgagcga ccttgcgaga g 

<210> 16 
<211> 32 
<212> DNA 

<213> Escherichia coli Gyrase A 
<400> 16 

tgaagcaatt gaattattct tcttctggct eg 

<210> 17 
<211> 32 
<212> DNA 

<213> Nocardia otitidis-caviarum 



31 



32 



<400> 17 

cggcgactct gcggccgcaa tgcgttatac gg 



32 
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<210> 18 
<211> 32 
<212> DNA 

<213> Nocardia oti tidis-caviarum 
<400> 18 

ccgtataacg cattgcggcc gcagagtcgc eg 32 

<210> 19 
<211> 31 
<212> DNA 

<213> Xanthomonas badrii 



<210> 20 
<211> 31 
<212> DNA 

<213> Xanthomonas badrii 
<400> 20 

ccgtctcttt ttctagagcg gecatcagtt c 31 

<210> 21 
<211> 61 
<212> DNA 

<213> Bacillus lentus 
<400> 21 

ggccgcaatg cgttatacgg aaatccgett agegaaaatt gcccatgaac tgatggccga 60 
t 61 

<210> 22 
<211> 60 
<212> DNA 

<213> Bacillus lentus 
<400> 22 

etagategge atcagttcat gggcaatttt egctaagegg atttcegtat aacgeattge 60 

<210> 23 
<211> 39 
<212> DNA 

<213> Mycobacterium xenopi Gyrase A 



<400> 19 

gaactgatgg ccgctctaga aaaagagacg g 



31 



<400> 23 



cgacccgcgc ggccgcaatg cgttattgca teaegggag 



39 



12 
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<210> 24 
<211> 45 
<212> DNA 

<213> Mycobacterium xenopi 
<400> 24 

gccaaaggcg ctaagcggat ttccgtgttg tggctgacga acccg 45 

<210> 25 
<211> 31 
<212> DNA 

<213> Streptomyces phaeochromogenes 
<400> 25 

atgggcatgc atatatatat aggcctgggc c 

<210> 26 
<211> 30 
<212> DNA 

<213> Streptomyces phaeochromogenes 
<400> 26 

caggcctata tatatatgca tgcccattcg 

3 0 

<210> 27 
<211> 39 
<212> DNA 

<213> Streptomyces griseoruber 
<400> 27 

gtttaagtct tgcttgcgat cgcttggcta tgacctgcc 39 

<210> 28 
<211> 38 
<212> DNA 

<213> Streptomyces griseoruber 
<400> 28 

gcctgacccc gaatttgaat cgattcttga cactgttg 38 

<210> 29 
<211> 38 
<212> DNA 

<213> Caryophanon latum 



<400> 29 

gcctgacccc gaatttgaat cgattcttga cactgttg 



38 



13 
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<210> 30 
<211> 38 
<212> DNA 

<213> Caryophanon latum 
<400> 30 

caacagtgtc aagaatcgat tcaaattcgg ggtcaggc 

<210> 31 
<211> 34 
<212> DNA 

<213> Gallus gallus alpha-spectrin 
<400> 31 

aatggtgcat gcaaggagat ggcgcccaac agtc 

<210> 32 
<211> 41 
<212> DNA 

<213> Gallus gallus alpha-spectrin 
<400> 32 

gctttggcta gctttcctgt gtcacctgct gatcatgaac 

<210> 33 
<211> 29 
<212> DNA 

<213> Proteus vulgaris 
<400> 33 

gcgtaaagct cgcgaccgtg ctcatatcc 

<210> 34 
<211> 29 
<212> DNA 

<213> Proteus vulgaris 
<400> 34 

ggatatgagc acggtcgcga gctttacgc 

<210> 35 
<211> 53 
<212> DNA 

<213> Gallus gallus alpha-spectrin 
<220> 

<223> ((W)NN)7-12 = synthetic randon oligo 



14 
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3 



0= 



<220> 

<223> At position 38, "W" = A or T 
<220> 

<223> At position 39 and 40, "N" = g, C, A or T 
<400> 35 

tgtcaagaat cgattcaaat tcggggtcag gctctccwnn atagccaagc gat 53 

<210> 36 
<211> 11 
<212> DNA 

<213> Gallus gallus alpha-spectrin 



<400> 36 
cgcttggcta t 

<210> 37 
<211> 31 
<212> DNA 

<213> Mycobacterium tuberculosis 
<400> 37 

aggtgagaat tcatggcggt cgttgatgac c 

<210> 38 
<211> 36 
<212> DNA 

<213> Mycobacterium tuberculosis 
<400> 38 

tatataaagc tttcatgtca ccgagccatg ttggcg 

<210> 39 
<211> 31 
<212> DNA 

<213> Mycobacterium tuberculosis 
<400> 39 

aggtgagaat tcatggcggt cgttgatgac c 

<210> 40 
<211> 33 
<212> DNA 

<213> Mycobacterium tuberculosis 
<400> 40 



11 



31 



36 



31 



15 
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tttcccacgc ccgggcacgc cgccacgatg acc 



33 



<210> 41 
<211> 32 
<212> DNA 

<213> Acetobacter aceti 
<400> 41 

gccgccgatc .cgcgacatcg tagatttcgg cc 32 

<210> 42 
<211> 32 
<212> DNA 

<213> Acetobacter aceti 



<210> 43 
<211> 89 
<212> DNA 

<213> Gallus gallus alpha-spectrin 
<220> 

<223> ((W)NN)7-12 = synthetic randon oligo 
<220> 

<223> At position 38, "W" = A or T 
<220> 

<223> At position 39 and 40, "N" = A, G, C or T 
<400> 43 

tgtcaagaat cgattcaaat tcggggtcag gctctccwnn atagccaagc gatcgcaggc 60 
agcttttaaa gccctgatgg ttcagacgt 89 

<210> 44 
<211> 15 
<212> DNA 

<213> Gallus gallus alpha-spectrin 



<400> 42 



ggccgaaatc tacgatgtcg cggatcggcg gc 



32 



<400> 44 



ctgaaccatc agggc 



15 



<210> 45 
<211> 7 
<212> PRT 

<213> Mycobacterium xenopi Gyrase A 



WO 01/32831 



<400> 45 

Glu lie Arg Leu Ala Lys lie 

1 5 



PCT/US00/29596 



<210> 46 
<211> 199 
<212> PRT 

<213> Mycobacterium xenopi Gyrase A 
<400> 46 

Cys lie Thr Gly Asp Ala Leu Val Ala Leu Pro Glu Gly Glu Ser Val 

5 10 15 



Arg lie Ala Asp He Val Pro Gly Ala Arg Pro Asn Ser Asp Asn Ala 

30 



20 25 



He Asp Leu Lys Val Leu Asp Arg His Gly Asn Pro Val Leu Ala Asp 
35 40 45 

Arg Leu Phe His Ser Gly Glu His Pro Val Tyr Thr Val Arg Thr Val 
50 55 60 

Glu Gly Leu Arg Val Thr Gly Thr Ala Asn His Pro Leu Leu Cys Leu 

65 ? 0 75 

'=> 80 



Val Asp Val Ala Gly Val Pro Thr Leu Leu Trp Lys Leu II 



85 90 



e Asp Glu 
95 



He Lys Pro Gly Asp Tyr Ala Val He Gin Arg Ser Ala Phe 



100 105 



Ser Val 
110 



Asp Cys Ala Gly Phe Ala Arg Gly Lys Pro Glu Phe Ala Pro Thr Thr 

125 



H5 120 



TYr Thr Val Gly Val Pro Gly Leu Val Arg Phe Leu Glu Ala Hi 



130 135 



His His 
140 



Arg Asp Pro Asp Ala Gin Ala Il e Ala Asp Glu Leu Thr Asp Gly Arg 
145 150 155 16 o 

Phe Tyr Tyr Ala Lys Val Ala Ser Val Thr Asp Ala Gly Val Gin 



165 170 



Pro 
175 



Val Tyr Ser Leu Arg Val Asp Thr Ala Asp His Ala Phe II 



180 185 



e Thr Asn 
190 
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Gly Phe Val Ser His Asn Thr 
195 
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